Optimized Dynamical Decoupling via Genetic Algorithms 



Gregory Quiroz 

Department of Physics and Center for Quantum Information Science & Technology, 
University of Southern California, Los Angeles, California 90089, USA 

Daniel A. Lidar 

Departments of Electrical Engineering, Chemistry, and Physics, 
and Center for Quantum Information Science & Technology, 
University of Southern California, Los Angeles, California 90089, USA 

We utilize genetic algorithms to find optimal dynamical decoupling (DD) sequences for a single-qubit system 
subjected to a general decoherence model under a variety of control pulse conditions. We focus on the case of 
sequences with equal pulse-intervals and perform the optimization with respect to pulse type and order. In this 
manner we obtain robust DD sequences, first in the limit of ideal pulses, then when including pulse imperfections 
such as finite pulse duration and qubit rotation (flip-angle) errors. Although our optimization is numerical, we 
identify a deterministic structure underlies the top-performing sequences. We use this structure to devise DD 
sequences which outperform previously designed concatenated DD (CDD) and quadratic DD (QDD) sequences 
in the presence of pulse errors. We explain our fundings using time-dependent perturbation theory and provide 
a detailed scaling analysis of the optimal sequences. 



I. INTRODUCTION 

Quantum information processing (QIP) relies on the ability 
to implement high-fidelity quantum gate operations and suc- 
cessfully preserve quantum state coherence [1]. One of the 
most challenging obstacles for reliable QIP is overcoming the 
inevitable interaction between a quantum system and its envi- 
ronment or bath. Unwanted interactions result in decoherence 
processes that cause quantum states to deviate from a desired 
evolution, consequently leading to computational errors and 
loss of coherence. In order for QIP to be realizable in the set- 
ting of open quantum systems, it is necessary to address the 
detrimental effects of decoherence. 

Dynamical decoupling (DD) is one such method, which 
seeks to attenuate the effects of decoherence by applying 
strong and expeditious control pulses solely to the system 
[2-7]. Provided the pulses are applied over a time duration 
sufficiently shorter than the correlation time associated with 
the environment dynamics, DD effectively averages out unde- 
sirable interactions and preserves quantum states with a low 
probability of error, or fidelity loss. One advantage of DD 
over quantum error correction (QEC) is that it is an open- 
loop technique, i.e., does not require feedback or measure- 
ment. Furthermore, DD has been widely experimentally stud- 
ied in a number of systems, including ion traps [8-10], nuclear 
magnetic resonance (NMR) [11-13], solid state quantum dots 
[14], and nitrogen vacancy (NV) centers in diamond [15-17]. 

The earliest known DD sequence constructions built upon 
the Hahn spin echo effect [18], by applying a sequence of 
control pulses implementing tt -rotations, each separated by 
a fixed time duration. Two particularly notable sequences 
include CPMG [19], which utilizes cycles of two identical 
pulses to preserve spin magnetization along a single direction 
(useful for known quantum state preservation), and XY4 [20], 
a four pulse multi-axis DD sequence that can increase coher- 
ence times isotropically (useful for unknown quantum state 
preservation). Both sequences were eventually extended to 



an "XY-family" which incorporates longer pulse sequences 
to improve coherence even further and provide robustness 
against errors generated by experimental imperfections in the 
control pulses [21]. 

In the context of open quantum systems of interest to us 
here, the key feature responsible for an increase in coherence 
time for CPMG and XY4 is the suppression of the first or- 
der term in time-dependent perturbation theory for a system 
weakly coupled to a bath and subjected to dephasing or gen- 
eral decoherence, respectively. Concatenated DD (CDD) is 
a deterministic sequence design that exploits this property by 
recursively embedding any base DD sequence (e.g., XY4) into 
itself to successively suppress an additional order of the per- 
turbation expansion at each level of concatenation [22]. CDD 
has been extensively studied analytically [23-25], numeri- 
cally [26-28], and experimentally [14, 29-33], for a variety 
of systems, and its predicted ability to achieve high order sup- 
pression has been largely confirmed. 

A CDD sequence of order q using a base sequence of K 
pulses uses equally spaced pulses to suppress the first q or- 
ders of the perturbation expansion in the ideal pulses limit for 
a single qubit system subjected to general decoherence (e.g., 

= 4 for XY4)[22, 23]. What distinguishes an XY4 -based 
CDD sequence of order q from, e.g., a periodically repeated 
XY4 sequence (PDD) with the same total number of pulses, 
is just the pulse order. Yet, (unsymmetrized) PDD achieves 
at most first order suppression, in contrast to qth order sup- 
pression for CDD. This begs the question of whether there 
exist other sequences that achieve even better performance 
than CDD, arrived at merely by optimizing the pulse order and 
type. This is the main question we address in this work, using 
numerical optimization based on genetic algorithms, supple- 
mented with simulated annealing. 

Notwithstanding recent progress in pulse sequence opti- 
mization using unequal intervals, in particular the UDD se- 
quence and its generalizations [34-44], we focus our opti- 
mization on the case of equal pulse-intervals, as this greatly 
simplifies the optimization, and is furthermore often the most 
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convenient experimental situation. Our first main result is to 
show that numerically optimal sequences comprising ideal, 
zero- width 7r-pulses can be located which exhibit the same er- 
ror suppression characteristics as XY4 -based CDD while us- 
ing times fewer pulses, where r is the concatenation level. 
Furthermore, we identify in these sequences a deterministic 
structure which we use to construct additional, robust high- 
order sequences. 

As the techniques of DD sequence construction have be- 
come increasingly sophisticated, the issue of robustness to 
control pulse imperfections has remained one of the promi- 
nent restrictions of sequence performance in experimental set- 
tings. Systematic errors such as rotation- angle or rotation- 
axis errors, and finite pulse duration errors brought about by 
bandwidth constraints, can generate additional decoherence 
that quickly destroys the decoupling efficiency of all known 
DD schemes. Robustness to such errors has been addressed 
by the XY-family of sequences and CDD, and in a system- 
atic manner — for pulse-width errors — by Eulerian DD (EDD) 
[45] and its generalization to logic gates, known as dynam- 
ically corrected gates (DCG) [46, 47]. A concatenated ver- 
sion of DCG (CDCG) has been shown to be capable in prin- 
ciple of achieving arbitrarily accurate gates using finite width 
pulses [48]. Protocols based on pulse-interval optimization 
have been shown experimentally to be highly sensitive to 
pulse imperfections, thus forfeiting their ideal-pulse decou- 
pling efficiencies [49]. Certain numerical optimization tech- 
niques such as locally optimized DD (LODD) [50], band- 
width adapted DD (BADD) [51], optimized noise filtration 
DD (OFDD) [52], and Walsh function DD (WDD) [53], ex- 
hibit a degree of robustness to finite pulse duration, however, 
the relationship between sequence performance and rotation 
errors is unclear. A more recent approach for combating pulse 
imperfections, known as Knill DD (KDD), utilizes a sequence 
of variable angle pulses separated by fixed pulse-intervals to 
generate an effective sequence of two 7r-pulses [54]. In con- 
trast to the XY-family and XY4-based CDD, KDD exhibits 
robustness to finite- width and flip-angle errors. One impor- 
tant question which we seek to address here is whether it is 
possible to obtain a similar robustness to both forms of pulse 
imperfections utilizing only 7r-pulses. 

To account for such errors, our numerical optimization is 
extended to include pulse imperfections, in particular finite 
pulse width, flip-angle errors, or both. We show that robust 
DD sequences exist which perform considerably better than 
the original CDD sequence and sequences based on unequal 
pulse-intervals, such as UDD and its variants [34-44]. How- 
ever, sequence performance eventually saturates with growing 
sequence length. Interestingly, we find that the deterministic 
structure identified in the ideal pulse limit provides robustness 
against both forms of pulse errors. 

The structure of this paper is as follows. In Section II, 
we supply background information and formal mathematical 
specifications for DD effectiveness in the context of general 
open quantum system dynamics. The particular error model 
utilized for this study is then discussed along with the details 
regarding the control Hamiltonian, which is responsible for 
the DD control fields. In Section III, we discuss how to apply 



Genetic Algorithms (GAs) to DD optimization. The results 
of our optimal sequence search are then given in Section IV, 
where we identify sequences that obtain a higher degree of DD 
efficiency than standard CDD sequences under the condition 
of ideal (5-function pulses, and exhibit robustness to certain 
forms of pulse imperfections; namely, over and under-rotation 
errors. In Section V, the search results are supplemented with 
a comparison between the GA optimal sequences, CDD, and 
QDD for each pulse specification defined in Sec. II. Optimiza- 
tion proves to be most notably beneficial in the case of pulse 
imperfections where the GA sequences convey their superi- 
ority over known deterministic sequences. In Sec. VI, using 
the results of Sec. IV and V, we discuss the existence of a 
concatenation-based deterministic scheme built from optimal 
sequences that offers robustness to both finite- width and rota- 
tion errors. We present our conclusions in Section VII, while 
additional information regarding DD performance scaling is 
given in Appendices A and B. 

II. BACKGROUND, PROBLEM SETUP, AND TOOLS 

In this section, we briefly review the pertinent basic back- 
ground and mathematical framework for DD. The general er- 
ror model is then introduced and its numerical implementation 
specified. The DD control Hamiltonian is discussed and spec- 
ified for each of the four pulse conditions employed in this 
work, differing by the degree and nature of the pulse imper- 
fections. Finally, we introduce a distance measure with which 
DD performance is quantified throughout the paper. 

A. Dynamical Decoupling 

Consider an open quantum system described by the Hamil- 
tonian 

H{t) = Ho^Hc(t). (1) 

The time-independent term Hq governs the internal dynamics 
of the system and environment, while Hc{t) is responsible for 
the time-dependent DD control fields. The Hamiltonian Hq is 
resolved further into 

Ho = Hs^Ib^Is^Hb^ Hsb. (2) 

where Hs is the pure system Hamiltonian, Hb is the 
pure environment Hamiltonian, Hsb represents the system- 
environment interaction, and Is{b) is the identity operator on 
the system (bath). 

For brevity, we denote 

i/err = Hs®Ib^ Hsb (3) 

as the error Hamiltonian, where the pure system and system- 
environment interaction Hamiltonians constitute the sources 
of undesired system evolution and decoherence, respectively. 
Removal of undesired system evolution is particularly rele- 
vant when DD is utilized for high fidelity quantum memory 



3 



storage [2, 23, 34] or in a "decouple-then-compute" approach 
to quantum gate construction [25, 55, 56]. Storing quantum 
memory requires initial state preservation, hence the desired 
system evolution is trivial action on the system. Any form 
of system dynamics present after the DD evolution would al- 
ter the initial state, resulting in storage errors. In a similar 
manner, gate errors can be acquired during the application 
of a nontrivial quantum gate if undesired system dynamics 
remain upon the completion of the gate operation. Unde- 
sired system evolution must also be removed in alternative 
DD-protected gate construction strategies, such as "decouple- 
while-compute" [55-57], in particular to prevent leakage er- 
rors when a decoherence-free subspace (DFS) or stabilizer 
code is used to enable computation while DD pulses are ap- 
plied [58,59]. 

We assume that the control Hamiltonian Hc{t) imple- 
ments the DD pulses solely on the system. Hc{t) conse- 
quently acts trivially on the pure environment Hamiltonian, 
[Hc{t)^ Hb] = Vt, and nontrivially on JYerr. The manner 
in which Hc{t) operates on i^err ultimately determines the 
effectiveness of DD in suppressing the contributions of the er- 
ror Hamiltonian to the system evolution. Demanding that each 
pulse operator anticommute with at least one term comprising 
i^err, the systcm is driven in such a way that i^err can be effec- 
tively averaged out for sufficiently short time durations. How 
short can be elucidated in the interaction ("toggling") picture 
with respect to Hc{t) [3, 55, 60, 61], where 



Ho{t) 



Ul{t)HoUc{t) 



(4a) 
(4b) 



with the control unitary 



Uc{t) 



rexp(^-i^ Hc{t)d?j, (5) 



where T denotes the time ordering operator. The unitary time 
evolution operator Uo{t) satisfies the Schrodinger equation 

i^^Uoit) = Homoit), Uo{0) = I (6) 

and Uo{t) = Ulj{t)U{t), where U{t) is the time evolution 
operator generated by Eq. (1). 

Employing time-dependent perturbation theory (TDPT) via 
the Magnus expansion [62, 63] to solve Eq. (6) we can write 



(7) 



with the anti-Hermitian operator Vt^'^\Tc) representing the 
nth term in the Magnus operator expansion after a total DD 
cycle time Tc. The leading terms of the expansion are 



I I Ho{ti)dti, 
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(8) 



, (9) 



while the higher order Magnus terms are constructed recur- 
sively as a sum of (n — l)-fold commutators. A sufficient 
condition for convergence of the Magnus expansion is [64] 
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\\Homdt<Tr. 



(10) 



In agreement with average Hamiltonian theory (AHT) [60, 
65], the time-dependent evolution generated by ^o(^) is for- 
mally identical to a time-independent evolution generated by 
the effective Hamiltonian 



Hb{tc)^H,,,{tc). 



(11) 



AHT applies here since we are only interested in the joint 
system-environment dynamics at the end of each stroboscopic 
DD period of Tc. We can further partition Hq into an effective 
pure environment term and a sum of effective error Hamilto- 
nians 



^e.(Tc)^-£0(»)(r,), 



(12) 



where Q^i^ {tc) is the nth order Magnus operator contain- 
ing non-trivial system operators, while Hb{tc) contains only 
terms with trivial action on the system. 

In light of Eq. (12) we find that DD facilitates an effective 
suppression of i^err by suppressing Hqyy{tc) up to some or- 
der in the Magnus expansion. When the first N terms of the 
expansion of Hq^{tc) vanish we speak of 'Wth order decou- 
pling." Assuming A^th order decoupling has been achieved, 
the toggling frame evolution is given by 



^ p-ire^B(re) + 0[(||^e'rrlh 



(13a) 
(13b) 

where the evolution is predominately dictated by the effective 
pure environment Hamiltonian Hb{tc) when ||5^^j^||rc <C tt 
and A- > 1. Here, H',,, = ^ E^^iv+i ^eS^(^c) denotes 
the remaining effective error Hamiltonian and || A|| is the sup- 
operator norm of A (largest singular value): 



sup 



1^) II W II ' 



(14) 



Thus the effectiveness of DD is dependent upon intrinsic 
properties, in particular the strength of the interaction and pure 
environment Hamiltonians. In situations where the internal 
dynamics are sufficiently fast, a short DD cycle time is de- 
sirable to maintain Eq. (10). Furthermore, it is also desirable 
to achieve high order error suppression, A^ 1, so that the 
effects of Hq^^ are less consequential. We make use of both 
conditions to analyze DD in the presence of various strengths 
of internal dynamics, while determining the minimum num- 
ber of control pulses required to obtain a given order of error 
suppression. 

As we shall see when we present the analysis of optimal 
sequences, starting in Sec. IV, the effective error Hamilto- 
nian (12) has significant explanatory power. 
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B. Error Model 

The system of interest is a single-qubit system generically 
coupled to its environment. The internal dynamics are gov- 
erned by 

^e{I,x,y,z} 

where and are the spin- 1/2 Pauli matrices and gen- 
eral bounded environment operators, respectively. Selecting a 
four-qubit spin bath to model the environment for the numer- 
ical search, the environment operators are given by 

B, = T.T.<p{<^^')' ^^^^ 

i^j a, (3 

where z, j index the bath qubits, a^P^ji G {/, and 
c^^ G [0, 1] are random coefficients chosen from a uniform 
probability distribution. The construction of permits at 
most two-body interactions between the environment qubits 
and three-body interactions between the system and environ- 
ment. Note that Eq. (16) contains terms proportional to the 
identity operator erf crj^, which account for the pure system 
Hamiltonian described in Eqs. (2) and (3). 

C. Control Hamiltonian 

The general form of a single qubit control Hamiltonian is 
Hc{t) = \ (17) 

jj.e{x,y,z} 

where V^{t) is the control field associated with the cr^ de- 
gree of freedom. All of the essential information regarding 
the DD sequence is contained within V^{t), i.e., pulse timings 
and amplitude profiles. In general, varying either quantity can 
result in drastically different optimal sequence constructions. 
Considering equal pulse-interval delay times of Td throughout 
the DD evolution, we examine how optimal sequence con- 
struction varies with amplitude profile. We consider the most 
customary pulse profiles: zero- width and rectangular, finite- 
width, and pursue an analysis of each profile with the addition 
of qubit rotation errors to model the existence of systematic 
errors brought about by faulty control fields. 

1. Ideal pulses 

The first type of pulse considered is an idealized, zero- 
width control field 

v^(t) = Y^(^oKt-t';), (18) 

3 

where the Dirac delta function 5{t) constitutes the pulse pro- 
file. The pulses are applied at times and the angle of rota- 
tion is given by (j)^. The control fields are restricted so that 



they act uni-axially for all time t. In terms of the single- 
qubit Bloch sphere, the condition can be visualized as allow- 
ing pulses solely along one of the three axes. We impose this 
constraint on all subsequent definitions of (t) as well. 

2. Finite-width pulses 

Since zero-width pulses are experimentally impossible, we 
relax the ideal pulse assumption and consider pulses of finite 
duration as well. We model the finite duration by 

v,{t) = J2a [e{t - 1^) - e{t - f; - r,)] , m 

3 

representing a piecewise continuous control field with a rect- 
angular profile [2] . The pulse amplitude is denoted by A and 
the pulse duration is r^, so that Arp = The Heaviside 
Theta function, 0(t), dictates the pulse profile where it is as- 
sumed that the time to turn the pulse "on" and "off is neg- 
ligible, therefore, the pulse is well approximated by a square 
wave. 



3. Flip-angle errors 

An additional form of systematic error we consider is that 
of an over- or under-rotation in the angle (/)o, commonly re- 
ferred to as a flip-angle error [60]. This particular type of 
error is relevant, e.g., in nuclear magnetic resonance (NMR), 
where inhomogeneity of the control field across the sample 
results in qubit rotation errors [12]. Flip-angle errors are also 
prevalent in other systems such as donor electron spins in Si 
systems [66, 67]. 

In the case of zero- width pulses, the control field takes the 
form 

^MW = E^o(i±6)j(t-t;), (20) 

3 

where e denotes the error in the rotation angle and the +(— ) 
refers to an over-(under-)rotation. By modeling the control 
field in this manner, it is assumed that the pulses are applied 
along their respective axes with zero or negligible error. The 
inclusion of rotation-axis errors has been previously studied 
for some common deterministic DD schemes [68], but is not 
included in our present study. 

4. Finite-width flip-angle errors 

As a worst case scenario we also consider the combined 
effect of flip-angle errors for finite- width pulses. Assuming 
that the error in the pulse duration is negligible, a flip-angle 
error can be thought of as an error in the pulse amplitude. We 
model the combined error control field by 

V,{t) =Y.A{1± e) [e{t - - e{t - - T,)] (21) 

3 



5 



and note that this particular form is one of the most prevalent 
pulse profiles encountered in experimental settings [13, 49, 
54, 66, 68]. 

D. Distance Measure and Scaling 

Rather than the standard Uhlmann fidelity or trace-norm 
distance [1] we use a state-independent distance measure, 
which significantly reduces the computational overhead. 
Namely, we quantify DD performance using 

D{U, G) = J— min \\U - G ^ (22) 

where U represents the full evolution operator of the sequence 
[i.e. U satisfies the Schrodinger equation (6) with the Hamil- 
tonian (1)] , G is the desired evolution of the system, the 
norm is the Frobenius norm \\X\\f = VTr(XtX), and ds 
and dB are the dimensions of the system and the environment 
Hilbert spaces Hs and Hb, respectively [69]. The minimiza- 
tion problem can be solved analytically to obtain the closed 
form expression [70] : 

where ||r||Tr = Tr(Vrtr) represents the trace-norm and 

r = Tr s[U {G^ 0Ib)], (24) 

where Tr^ denotes a partial trace over the system degrees of 
freedom. In the subsequent analysis, G = Is for the desired 
DD evolution and we denote D = D{U^I). 

We characterize optimal sequence performance with re- 
spect to two parameters associated with the internal dynamics: 
the "strength" of the error Hamiltonian and pure environment 
dynamics given by 

J=\\H,^l P=\\HbI (25) 

respectively, and three parameters whose relevance depends 
on the control Hamiltonian specifications: the pulse-interval 
Td, the pulse duration r^, and the rotation angle error e. Each 
of these parameters can be utilized to extract the scaling of the 
dominant term in the effective Hamiltonian by analyzing D as 
a function of the parameter of interest. 

In the case of ideal pulses, the distance can be shown to be 
upper-bounded as 

D<0[{J + pf+'T^+^], (26) 

where N is the order of error suppression. See Appendix A for 
a proof. Analyzing I) as a function of Td, J, and /3, the order 
of error suppression and essentially the structure of the dom- 
inant effective error Hamiltonian operator can be determined 
for the relevant situations where the dynamics are dominated 
by system-environment interactions {J ^ P) or the bath dy- 
namics (J <^ /3). Similar studies can be performed for finite- 
width or flip-angle errors as well. We ultimately utilize this 
method in conjunction with AHT to characterize each optimal 
sequence and determine robustness to various pulse errors. 



III. ALGORITHM 

Genetic Algorithms represent an approach to optimization 
problems based on the properties of natural evolution. Given 
an initial population and a definition of fitness, the algorithm 
simulates the processes of selection, reproduction, and mu- 
tation in an attempt to locate the member in the population 
with the highest probability of survival. In regards to DD, the 
population can be thought of as a subset of all possible se- 
quence configurations, where a configuration is specified by 
the order and types of pulses, for a given sequence length K. 
The member with the highest probability of survival is the se- 
quence which maximally suppresses system-bath interactions 
with respect to a particular distance measure. In the following 
sub-sections we outline the representation of the population 
and discuss how selection, reproduction, and mutations are 
implemented in the setting of DD optimization. 



A. Sequence construction 

The search is performed in the "lab" reference frame de- 
fined by the Hamiltonian H{t) of Eq. (1). Due to the piece- 
wise continuous form of Hc{t), the time evolution operator 
U (tc) can be decomposed into a series of unitary operators. 
Demanding fixed pulse-intervals, a sequence of K pulses ap- 
plied Sittj = jTd, j = 1, 2, . . . , iC, yields the time evolution 
operator 

U{Te) = Pk fr, Pk-1 /r. • • • ^2 fr, A fr, , (27) 

where Pj is the unitary evolution operator achieved by the jth 
pulse and fr^ = e~^^^^'^ designates the "free evolution" prop- 
agator between successive pulses. The pulse operators are de- 
fined such that Pj e Q, where Q denotes a discrete set of al- 
lowable control pulses that depends upon the choice of V^{t). 
The total sequence time Tc is also dictated by the choice of 
Vjj^{t) since the finite duration of the pulse contributes when 
applicable, e.g., for a sequence of rrid pulse delays and rrip 
nontrivial pulses Tc = mdTd-\-mpTp. While Eq. (27) is not the 
most general DD evolution operator for fixed pulse-intervals, 
since it does not permit consecutive pulses without free evo- 
lution periods, it still captures a majority of the known se- 
quences and additional highly robust sequence constructions. 



B. Chromosome structure 

The canonical approach to GAs is to define a member of 
the population by a set of genes, loosely referred to as a chro- 
mosome. Each gene can be thought of as a parameter in the 
optimization problem which contributes in some way to the 
fitness, and therefore the probability of selection, of the mem- 
ber. Defining a member in the population as a DD sequence, 
Eq. (27) is translated directly into its corresponding chromo- 
some 

Cf^ = {Pi,P2,...,Pk-i,Pk}, (28) 
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representing the jth member in the ath generation. The genes 
are given by the pulses in the sequence, therefore the number 
of genes increases with increasing sequence length. In gen- 
eral, a sequence and its corresponding chromosome do not 
have to be structurally equivalent. Later we will elaborate on 
why the naive translation of Eq. (28) is not favorable for DD 
optimization and discuss how it can be refined; however, for 
now Eq. (28) is adequate to describe each aspect of the algo- 
rithm outlined in the subsequent sub-sections. 

The population is given by the set of chromosomes 
where each cj"^ corresponds to a sequence 

Uj^\rc) and Q is the population size. The total number of 
possible sequence configurations, JV{K), is determined by 
both the length of the sequence and the number of pulse types 
in Q. The size of the sequence space grows exponentially with 
the length of the sequence, M{K) = \G\^ , where |^| is the 
number elements in Q. 

The search space can be greatly reduced by imposing the 
cyclic DD condition Uc{tc) = Is^ which is applicable for 
our focus on quantum memory preservation. The condition 
can be recast in the context of the search problem as 

K 

J]pf^i^/5 (29) 

i=i 

on all Cj'^\ where only the ideal, zero- width version of the 
pulse is used when finite- width or flip-angle error pulse pro- 
files define V^(t). Applying Eq. (29), the search space is re- 
duced to A/}? (i^) = where only \g\-^ of the original 
search space accounts for viable DD sequences. 

The initial population is chosen at random from the reduced 
search space, such that Q <C Mr{K). In general, the size of Q 
is somewhat arbitrary and expected to vary depending on the 
number of degrees of freedom specified by the problem. In the 
context of DD optimization, the size of the initial population 
will utlimately end up fixed for all sequence lengths due to 
the structure of the initial chromosomes; see Section IIIFl 
for additional details. 



C. Selection 

Associated with each chromosome cj"^ is a selection prob- 
ability p^^^ . This quantity defines the probability of being se- 
lected for reproduction in generation a and is given by 



where q^^^=— log^o D^j^^ represents the performance, or fit- 
ness, of the jth sequence. Here, we impose the cyclic DD con- 
dition as wen, G = Is, and denote of^ = Df\u{Tc), Is)- 
The logarithm is included in the definition of the fitness due to 
complications with the selection probability that are attributed 
to the extreme sensitivity of Eq. (22), and any distance mea- 
sure for that matter, to sequence variations. The exchange of 



a single pulse in a sequence with any other member of the de- 
coupling set can result in a change in performance up to many 
orders of magnitude. Since the reduced search space does not 
dismiss all poor-performing sequences, the fitness can range 
greatly in any generation. As a result, there is a reduced con- 
tribution of high performance sequences in the selection prob- 
ability distribution. The logarithm counteracts this issue by 
increasing the resolution of the selection probability. 



D. Crossover 

In each generation 2Q offspring are produced from the cur- 
rent population. Members of the population are chosen for re- 
production based on their probability of selection. The selec- 
tion process is constrained such that the crossover procedure 
only occurs between two distinct members of the population. 
Members with a high probability of selection not only pos- 
sess a higher likelihood of reproduction, but also have a higher 
probability of reproducing with multiple members in a single 
generation since each crossover is an independent event. 

Reproduction is implemented by a crossover between two 
members in the population, yielding two offspring. To best 
illustrate the crossover, consider the two chromosomes 

cj") = {Pi,...,Pi,...,Pfe}, (31) 

Cj? = . . . , . . . , (32) 

where Pi^Ri G Q. The offspring are created by splicing the 
parent chromosomes at a location chosen at random, where 
each pulse location has an equal probability of being chosen. 
Taking the splice point to be the ith pulse site, the resulting 
offspring are 

cj"^ = m,...,p^,i^^+i,...,i^fe}, (33) 

Cj"^ = {i^i,...,i^^,P^+i,...,Pfe}. (34) 

It is essential that the offspring still satisfy Eq. (29), how- 
ever it is not necessarily true that each is guarenteed to do 
so. If the DD condition is not satisfied, the pulse located at 
the splice point is manipulated until the condition is satisfied. 
For example, if cj"^ does not fulfill the DD condition then it 
is transformed to 

^ |p^^ ^ Ri^i, . . . , Rk}, (35) 

where it now is in agreement with Eq. (29) and e Q. In the 

situation that cj"^ cannot be found, the splice point is chosen 
again and the process is repeated until the proper offspring are 
created. 

The condition set forth by Eq. (29) restricts the crossover 
process and in some cases does not allow it at all. By per- 
mitting the manipulation of the pulse at the splice point, it is 
ensured that only the probability of selection dictates repro- 
duction. It is always possible to construct offspring from the 
above process, since there is no constraint on yielding off- 
spring which are identical to the parent chromosomes. Thus, 
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every set of parent chromosomes is guaranteed to produce 
some form of offspring. 

Upon producing the 2Q offspring, the best (5/4 parents and 
3(5/4 offspring are taken to be the new population. The par- 
titioning was chosen based on what appeared to be the most 
beneficial to the convergence of the algorithm. No duplicate 
sequences are allowed in the new population, however if the 
updated population size is less than Q then new members are 
generated at random from within the reduced search space. 



E. Mutation 

After reproduction, the new population composed of (5 /4 
parents and 3(5/4 offspring is used to create 2Q mutated se- 
quences, Q single- site and Q double- site. Every sequence in 
the population participates in both mutation processes, how- 
ever only a portion of the mutated sequences is retained for 
the succeeding generation. 

Single-site mutations are performed by choosing a pulse 
site at random and altering the pulse until Eq. (29) is again 
satisfied. If the DD condition is unsatisfiable then the origi- 
nal pulse is replaced and a different pulse site is chosen. It is 
possible that only the original configuration satisfies Eq. (29). 
In this situation the mutated member is simply a duplicate se- 
quence, therefore it is discarded. 

Double-site mutations correspond to linked single-site mu- 
tations. The process begins in a similar manner by choosing 
a pulse site at random, say the i\h site with pulse P^. An ad- 
ditional pulse site is now chosen at random from the set of 
pulse sites which have pulse types equivalent to P^, e.g., the 
jth site. Both Pi and Pj are updated simultaneously until the 
DD condition is again satisfied. If an additional pulse site does 
not exist, then the initial site is re- selected and the double- site 
mutation process is repeated. As in the case of the single-site 
mutation, if the DD condition cannot be satisfied then the mu- 
tated sequence is accepted as the original configuration and 
discarded. 

At the conclusion of the two mutations, a portion of the 
parent, offspring, and mutated sequences will comprise the 
new population. Only the sequences that have the highest fit- 
ness with respect to Eq. (22) are desired from each division of 
the population. We find that the best (5/8 parent, 5(5/8 off- 
spring, (5/8 single-site mutated, and (5/8 double- site mutated 
sequences comprise a favorable distribution for the new popu- 
lation. Other distributions were considered such as taking the 
best (5/4 of all mutated sequences, as well as different pro- 
portions of the offspring. However, no distribution appeared 
to yield a higher probability of optimal sequence convergence. 



F. Necessary Convergence Accelerators 

As noted above, single-site perturbations may result in 
large deviations in sequence performance. Hence, the log- 
arithm was introduced to decrease the performance gap be- 
tween poor- and well-performing sequences, thereby increas- 
ing the resolution of the selection probability. However, this 



— : Pi^^^^P^P^PqPjP^PqPi^P^^P^ 

= 1 • Pi ^ Ps P4 Ps P dPjo Pi 1 P12 P1 3P14 Pi5 P16 



— 2 : Pi^^^^APbPpPT^f^PoPioPiiPuPisPuPibPiG 



I = ^ '■ P1P2P3P4P5P6P7P8P9P10P11P12P13P14PL5P16 

/ = 4 : P1P2P3P4P5P6P7P8P9P10P11P12P13P14P15P16 

1 = 5: P1P2P3P4P5P6P7P8P9P10A1P12P13P14P15P16 

1 = 6: P1P2P3P4P5P6P7P8P9P10P11P12P13P14P15P16 

FIG. 1. Complexity-increase protocol for K = 16. The upper (blue) 
curves denote the linking between odd pulse sites and the lower (red) 
curves denote even pulse site linking. The process begins with all odd 
and even pulses linked, then continues by removing links between 
every other even pulse site. Links are removed until all even sites are 
uncorrelated, thereafter the odd sites undergo the same process. 



adjustment only proves to aid in optimal convergence for se- 
quences comprised of K < 16 pulses. This is evident from a 
simple comparison between CDD and numerically located se- 
quences at = 16, 64, 256, where the numerically "optimal" 
sequences perform far worse than CDD. 

We suspect that the local minima convergence is ultimately 
attributed to significant deviations in sequence performance 
that result in relatively large local minima traps. We alleviate 
this complication by introducing two convergence accelera- 
tors which act to reduce the size and presence of large traps, 
thereby smoothening what we refer to as the fitness landscape. 
Both accelerators are crucial for the algorithm to converge 
on global optima as the number of pulses increases beyond 
K = 16. 



1. Reducing Local Traps via Complexity 

Although the size of M{K) is decreased by imposing 
the cyclic DD condition, the resulting reduced search space, 
JVr{K), maintains its exponential scaling in the number of 
control pulses. Hence, there is still a high probability of the 
subspace containing low-performance sequences that lead to 
large local traps. This issue is resolved by reducing the search 
space further and systematically increasing its size as the algo- 
rithm iterates, such that the search space at the termination of 
the algorithm is Mr{K). The additional reduction is achieved 
by constraining the complexity of the chromosome, thereby 
moderating the possible sequence configurations. 

Initially, we choose each chromosome to represent the most 
elementary two-dimensional sequence 

cj'=°'"=°^ = {n„P.J, (36) 
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where /=0 is the initial complexity index. The notation Pg- 
denotes a pulse Pj applied at the locations specified by the 
set Sj. We choose si and S2 to contain only the odd and 
even pulse sites, respectively, for the initial population. This 
is a relevant construction since all known deterministic DD 
schemes utilizing fixed free intervals contain the same pulse 
at either every even or odd site [20, 23]. Moreover, it conve- 
niently reduces the space to only | ^ p sequence configurations 
for all K. 

For a general complexity index the chromosome is de- 
fined as 



where we require that 



K{1) 



all sites and 



K{1) 



(37) 



(38) 



be satisfied so that only one control pulse is applied at each 
pulse site. The number of sets {sj} is determined by 



§(/ + !) : /even 



(39) 



1) 



/odd 



for / = 0, 1, 2, ... , /max- At maximum complexity, /max, the 
most general sequence within JVr{K) is permitted. Hence, 
each Sj is a single element set containing only the jth pulse 
site. An example of the complexity-increase procedure is il- 
lustrated in Figure 1 for K = 16. Note that at each level 
of complexity-increase we have chosen to remove constraints 
only pertaining to odd or even sites. The constraint between 
every other even site is removed until each even pulse site 
is independent, afterwhich the same is performed on the odd 
sites. 

In contrast to Eq. (28), the number of elements in Eq. (37) 
increases as the algorithm iterates. It is important to note that 
this aspect does not imply an increase in the number of pulses, 
rather an increase in the permissible search space. This is 
an attractive feature since it not only diminishes the presence 
of local traps, but also yields an initial set of sequence con- 
figurations which only scales quadratically in \Q\. Choos- 
ing sequences for the initial population is obviously much 
more favorable here since the space is drastically smaller than 
Mr{K). In principle, it may even be possible to choose the 
entire set as the initial population. For a single-qubit system 
subjected to ideal 7r-pulses, we find that the complete initial 
set of configurations is indeed computationally convenient, 
consisting of only 16 possible configurations. Other pulse 
profiles lead to larger initial sets, but, remarkably, optimal se- 
quence convergence is possible for initial populations of only 
16 sequences. 



2. Fitness annealing 



of the chromosome only the probability of generating local 
traps is diminished. In order to control the relative differences 
between high- and low-performance sequences, we introduce 
an annealing process into the selection probability. Adopted 
from Ref. [71], the selection probability is redefined as 



[compare with Eq. (30)] such that 



/ (a) _ (a) ^ 
~(^a)/^7.^ / ^3 ^best 



9''"^(T) = exp 



T{a) 



(40) 



(41) 



The performance of the most fit member in the ath generation 
is denoted by q^^^ and the temperature function is given by 



T[a) = To 



-x/ c 



mir 

1 — T^sm a 

OLr 



(42) 



The temperature function utilized here is a modified version of 
the one introduced in Ref. [71], where we have included the 
sinusoidal function to reduce the probability of local minima 
convergence as T(a) decreases from the initial temperature 
To to the final temperature T/. The number of generations 
between these temperatures is dictated by the cutoff genera- 
tion OLc, which is chosen based on the value of K. For large 
K, we pick otc to be large as well since the annealing process 
is to accelerate global minimum convergence while reducing 
the probability of local minimum convergence. The remain- 
ing parameters r] and m are related to the amplitude and fre- 
quency of the oscillations, respectively. Upon increasing the 
complexity index ^, the annealing process resets with an ini- 
tial temperature To chosen so that all sequences in the current 
population have an equal likelihood of being chosen for repro- 
duction 



IV. OPTIMAL SEQUENCES 

In this section, we present numerically optimal tt -pulse se- 
quences obtained for the single-qubit system described by 
Eqs. (15) and (16). First, we consider the case of ideal zero- 
width pulses. We continue the analysis to include finite- width 
rectangular profiles, flip-angle errors, and finally the culmina- 
tion of both types of errors. For all pulse profiles the num- 
ber of pulses is varied from K = 1,2,..., 256 with a pulse- 
interval Td = O.lns.^ 

The value of K was varied over a significant range in our 
simulations, however, we found that only specific values of 
K are relevant for successive error suppression. In particu- 
lar, Kopt = 4, 8, 16, 32, 64, 256 correspond to the minimum 
number of pulses required to observe an increase in error sup- 
pression or significant improvement in performance. All of 



Substantial differences in sequence fitness manifest local 
traps in the fitness landscape. By decreasing the complexity 



^ Our choice of units is arbitrary but is meant to be commensurate with elec- 
tron spin qubits in e.g., quantum dots. 
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the remaining values of K result in a sequence performance 
upper bounded by the performance of the previous i^opt- For 
example, the optimal sequences for K = 17, 18, 31 ex- 
hibit a performance proportional to that of K = 16, if not 
worse. 

The values of i^opt were obtained by analyzing the scaling 
of the performance of the optimal sequences identified at each 
value of K in the ideal pulse limit. For K < 12, optimal se- 
quences were located by an exhaustive search, while K > 12 
demands the use of the GA algorithm discussed above. Upon 
locating the optimal sequences, the performance measure D is 
analyzed as a function of Td. The order of error suppression, 
A^, is then determined from Eq. (26) by 



N = \og,,{D)-l. 



(43) 



Values of K where N is found to increase ultimately corre- 
spond to those identified as values of ATopt. 

We find that the scaling method described above is a quite 
convenient numerical method for determining the structure of 
the dominant term in the effective error Hamiltonian for each 
of the optimal sequences obtained for a given ATopt- In order to 
fully characterize the scaling of Hq^y it is necessary to analyze 
distance measure as a function of each relevant parameter. 
In the case of ideal pulses, this would correspond to analyzing 
the performance as a function of { J, /3, Tc^} in the regimes of 
interaction-dominated dynamics (J {3) and environment- 
dominated dynamics ( J <C /3) since each regime may exhibit 
different scalings. When finite pulse duration and flip-angle 
errors are included the number of parameters increases to ei- 
ther a subset of { J, /3, r^i, r^, e} or the entire set if both forms 
of pulse errors are present. It is then necessary to analyze the 
scaling of D in each of the various parameter regimes in ad- 
dition to the interaction or environment-dominated regimes, 
e.g., pulse-width dominated (jp ^ Td), free evolution dom- 
inated (Tp <C Td), flip-angle error dominated (e ^ Jrd), 
etc. In the subsequent analysis, we make use of this tech- 
nique in conjunction with direct calculation of the effective 
error Hamiltonian to fully characterize sequence performance 
for each pulse profile. The effective Hamiltonian calculation 
is utilized to provide additional insight into the structure of 
Hqyt that can not be observed from the scaling method, most 
notably in situations where multiple sequences exhibit identi- 
cal performance scalings. 



A. Ideal pulses 

Here we examine the optimal sequence structure of ideal, 
zero-width pulses with respect to the strengths of the inter- 
nal dynamics, J and (3, in the regime where JTd^^Td G 
[10~^^, 10^]. Under the condition of uni-axial pulses, the set 
of possible control pulses Q = {/, X, F, Z}, where / is the 
identity operator and 



X{Y,Z) = -I (J 



x{y,z) 



(44) 



describe 7r-pulse unitary operators generated by Eq. (17) when 
Hc{t) is non-zero. Neither the error Hamiltonian, nor pure 



environment Hamiltonian, is present during the pulse evolu- 
tion since the effects of Hq are negligible in the limit of zero- 
width (infinite amplitude) pulses. 

Characteristics such as the dimension of the reduced search 
space N'r{K) are determined by the number of elements in 
the pulse set Q. Under the conditions of ideal ^-function 
pulses, Mr{K) = 4^~^ at / = /max- Consequently, the 
initial search space only contains 16 possible sequence con- 
figurations for all K using the complexity-increase procedure 
described in Sec. IIIF 1. All 16 are chosen to represent the 
initial population at the commencement of the algorithm and 
the size of the population is kept constant throughout. 



1. 4-Pulse Sequences 

The initial search is performed for = 4 to benchmark the 
algorithm. The expectation is that the well-known universal 
decoupling sequence [20] 



(45) 



should appear as the optimal sequence since it achieves first 
order error suppression for general single-qubit decoherence 
[3]. We confirm that XY4 is indeed the optimal sequence 
within the range of J, (3 given above, along with its obvious 
generalization 



GA4 :- P2fTdPlfTdP2fTdPlfTd^ 



(46) 



where Pi ^ P2 e {X,Y,Z}. 

The order of error suppression is confirmed by analyzing 
the scaling of Eq. (26) with respect to { J, /3, r^i}. We find 



D 



0{JpTl) : J</3 
0{J^tI) : J>/3 ' 



forXn, (47) 



These results are consistent with our expectation of first or- 
der error suppression and direct calculation of the effective 
error Hamiltonian, e.g., for GA4 = XY/^: 

^\rd(j' ([Po, B,] - i{B,, By}). (48) 

Additional results are shown in Fig. 2(a), where the perfor- 
mance of GA4 is shown for a wide range of J, (3 values. 



2. 8-Pulse Sequences 

Second order error suppression is achieved for K 
pulses using the generalized sequence structure 



GAs :=Ps{GA^)P3{GA^). 



(49) 



such that Pi,P2,^3 e and Pi ^ P2 is implied 

by the definition of GA4. Note that there is no free evolution 
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FIG. 2. (color online) Performance of GAr sequences for K — 4,8, 16, 32, 64, 256, as shown in (a)-(f), respectively, as a function of J and 
p. The minimum pulse-interval is fixed at Td = 0.1ns and the results are averaged over 10 realizations of B^. The GAr a sequences tend to 
be optimal in the range J < /3, while GAk b are optimal for J > /3. Sequences closely related to the deterministic structure of CDD, GAr c, 
begin to appear as optimal sequences for Jrd > 1 at = 64. The notation n x GA4 denotes the application of n cycles of GA4 and is 
dependent on the value of K. For the data presented above, n = 1, 8, 16, 64 for K = 4, 32, 64, 256, respectively. 



period between P3 and P2. In Figure 2(b), results for the op- 
timal sequence search are shown with respect to J and /3 for 
two versions of Eq. (49): 

GAsa := IP1P2P1IP1P2PU (50) 

when P3 = P2, and 

GAsb := (P3P2) P1P2P1 {P3P2) P1P2P1, (51) 

for P3 7^ P2 . We have dropped the free evolution periods for 
convenience of notation and highlighted the pulses which are 
not separated by free evolution periods with parentheses. 

The results shown in Figure 2(b) indicate that GAsa is gen- 
erally optimal when Jrd < 10~^ and J < (3, while GA^b is 
optimal for Jrd > 10~^ and J > /3. The partitioning of the 
space can be understood in terms of the scaling of the perfor- 
mance, where 

D - 0{J(3^rl) VJ, (3, for GAsb. (52b) 

We obtain this scaling numerically by analyzing the perfor- 
mance as a function of each of the relevant parameters; see 
Appendix B for details. As the strength of the error Hamilto- 
nian becomes large, Jrd ^ 1, it is clearly more beneficial to 
utilize GAsb since the effective error Hamiltonian scales lin- 
early in J; as opposed to the cubic scaling of GA^a- Note that 



the GAg sequences achieve an additional order of suppression 
relative to XI4, as can be seen by comparing the order of rd 
in Eqs. (47) and (52). 

Examining the sequence structure of GAga, a correspon- 
dence between this configuration and 

XYs = IXYXIXYX (53) 

is readily observed when Pi = X and P2 = Y. Known for 
providing second order error suppression for general single- 
qubit decoherence [21], XYg gains its decoupling attributes 
from its structure: the XY4 sequence followed by a time- 
reversed copy. An alternative perspective of XYs is given by 
GAs, where it appears as a concatenation of XY4 (GA4 = 
XY4) and CPMG (P3 = X) [19]. Depending on the choice 
of the CPMG pulses two different variations of GAs are ob- 
tained: GAsa, which is a generalized version of XYs, and 
GAsb- It turns out that concatenation is a key aspect of opti- 
mal sequence construction beyond GAs and will continue to 
appear for larger values of K as well. 

3. 16-Pulse Sequences 

We identify two optimal sequences for i^T = 16: G^iea 
and GAiQb' The former can be expressed in terms of GAsa as 

GAi^a := P3 {GAsa) P3 {GAsa) , (54) 
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such that P3 7^ /, while the latter sequence is given by 

: = P4 (GA4) Ps (GA4) P4 (GA4) P3 (GA4) (55a) 
= GA4GA4]. (55b) 

We introduce a useful notation in Eq. (55b) to represent the 
concatenated structure of Eq. (55a) in a condensed form. The 
pulse operators for both GAiQa and GAiq^ satisfy Pj G 
{X, F, Z} Vj. We point out that the sequences are not re- 
quired to contain the same pulse operators at each level of 
concatenation, therefore Pi ^ P3 and P2 7^ P4 in Eq. (55b) 
is also valid. 

The regions of optimal performance are depicted in Figure 
2(c) for both sequences. Similar to the case of K = 8, GAiQa 
is generally optimal for J < 10 ~^ and (3 > J, while the 
remaining portion of the parameter space, J > 10~^ and (3 < 
J, is mostly dominated by The equivalence between 

K = 8 and K = 16 in the J, /3 parameter space is most 
likely due to the fact that second order error suppression is still 
obtained, so the effective error Hamiltonians are quite similar. 
More specifically, the scaling of the performance for GAiQa 
and GAiQj) is consistent with GA^a and GA^b, respectively. 



D 



for GAiQa, 
for GA166. 



(56a) 
(56b) 



Even though an additional order of error suppression is not 
achieved at i^T = 16 relative to K = 8, we include the analysis 
to show a correspondence between known 16-pulse sequences 
such as XYiQ [21] and CDD2 (see Sec. V), and the GAiq se- 
quences. In the definition of Eq. (54) it was noted that P3 ^ I 
constitutes a generalized optimal sequence for K = 16. Inter- 
estingly, the discarded sequence corresponds exactly to XYiq. 
We conclude that XYiq is not within the range of optimal se- 
quences due to the fact that it does not reduce errors in any of 
the three qubit decoherence channels; errors actually accumu- 
late because it is simply two cycles of XYg. The second level 
of concatenation in CDD is found from GAiq^ when Pi = P3 
andPs =P4. 



4. 32-Pulse Sequences 

A minimum of = 32 pulses is required to achieve third 
order error suppression. The optimal sequence structures are 
described by 



GAs2a -GA^GAsa] 
GAs2b :=GA8a[GA^], 



(57) 
(58) 



where the difference between the two is the order of concate- 
nation. In Fig. 2(d), the regions of optimal performance for 
GAs2a and GAs2b indicate a clear distinction between the two 
sequences and, therefore, concatenation procedure. Optimal 
performance for GAs2a is well-characterized by Jrd < 10~^ 
and ^Td < 1. The latter sequence, GA326, dominates the re- 
gion where Jr^ > 10~^ V/3, and also appears to be optimal 
in portions of the parameter space where Jrd > 10 ~^ and 



13 < J. The remaining portion of the space, pTd > 1, is dom- 
inated by a sequence composed of 8 cycles of GA4, denoted 
by n X GA/i, with n = 8, Fig. 2(d). In this particular regime 
the environment dynamics are fast with respect to the time 
scale of the pulse delay. The cycle time reduces to effectively 
combat decoherence on the time scale of the environment dy- 
namics. 

The performances for the GA^2 sequences scale identically, 
following 



D 



0{jP'T^d) 
0{J^P'T^d) 



J>/3. 



(59) 



for Jrd < 1 and fird < 1. Differences in the performance of 
equal order sequences can be due to differences in the "per- 
formance amplitude," namely, the fact that the effective er- 
ror Hamiltonian can contain the same error terms with larger 
coefficients, hence reduced performance. Futhermore, higher 
order contributions can play a role in performance variations 
as j3Td 1 and Jrd 1. 



5. 64 -Pulse Sequences 

Optimal sequences located at K = 64 acquire fourth order 
error suppression via the generalized sequence structures 



GAQ4a •— GAsa[GAsa], 

GAQ4b •= GAsb[GAsb]^ 



(60) 
(61) 



each of which correspond to a first level concatenation of 
GAsa or GAsb, respectively. Additional combinations of 
concatenation, e.g., GAsa[GAsb] and GAsb[GAsa] turn out 
not to be optimal since only third order error suppression is 
achieved. The pulse operators satisfy Pj G {X, F, Z} Vj as 
in previous cases. 

Sequence performance for both GA^^a and GA^^b for 
Jrd <C 1 and /^r^ <C 1 scales as 



D 



I OiJ'P'rl) 



J<^f3 

J:>f3, 



(62) 



where GAQ^a is optimal in the region of J < /3 for /3rd < 1 
and GAQ4b is optimal for J > f3 for Jrd < 1. The distinc- 
tion between the regions of optimal performance, shown in 
Fig. 2(e), is due to a transition in performance at J = (3 
that results in GAQ4b becoming the preferred construction. 
We suspect that the effective Hamiltonian for GA^^a contains 
terms which result in a -scaling in D, in addition to the 
dominant J^ term, that diminishes performance once in the 
J > j3 regime. The GAQ^b construction does not receive a 
significant contribution from the quadratic term, if any at all; 
hence, the increase in performance. 

There are two additional sequences shown in Fig. 2(e). The 
n X GA4 sequence denotes the application of n = 16 cycles 
of GA4 to obtain the 64 pulses required for the search. This 
particular sequence construction is optimal for I3rd > 1 V J, 
as in the case of i^T = 32. The remaining sequence 



GAe^, = GA4GA4GA^]] 



(63) 
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obtains third order error suppression and exhibits a perfor- 
mance scaUng as 



D 



J^f3 
J>/3 



(64) 



The region of optimal performance is well-characterized by 
Jrd > 1 V/3, the high-frequency coupling regime. Note the 
correspondence between GAq^c and the third level of con- 
catenation in CDD. Similar to = 16, we find that CDD 
remains an optimal sequence construction in specific parame- 
ter regimes. 



6. 256-Pulse Sequences 

The final optimal sequence structures we have obtained are 
fox K = 256. All sequences are classified into three general 
structures denoted by GA256c- The GA236a 

and constructions achieve fifth order error suppres- 

sion by concatenating GAQ^a with GA4 as 



GA' 



256a 



GA^GAsalGAsa]] 

GA^GAe^a] 



and concatenating GAs2b with GAsb to obtain 
GA2,6b = GAsb[GAsa[GA^]] 

= GAsb[GAs2a]- 

The remaining sequence structure, 

GA2S6C = GA^GA^GA^GA^]] 



(65) 



(66) 



(67) 



attains fourth order error suppression and is essentially a gen- 
eralized version of the fourth level of concatenation of CDD. 
As in = 64, Pj e {X, r, Z} Vj. 

In examining the performance as a function of J and Td for 
GA256a, we find the scaling 



D 



J>/3, 



(68) 



and a similar scaling for the performance for GA2566 up to 
jTd > 10~^. The performance for GA256C scales as 



D 



J>/3 



(69) 



As found for K = 64, successive levels of GA^ concatena- 
tion, i.e., is predominantly optimal when Jrd > 1. 
Note that in the region where I3rd > 1, 64 x GA4 (64 cycles) 
remains the optimal sequence V J. 

The regions of optimal performance are depicted in 
Fig. 2(f). Two transitions in sequence performance are ob- 
served: one at J = /3, where GA2566 becomes the pre- 
ferred sequence structure, and a second at Jr = 10~^, where 
GA256a regains dominance and the performance of GA2566 
diminishing rapidly. Although the performance scales equiv- 
alently for GA2^Qa and GA2566, there is an obvious inherent 



Sequence 


J «.p 


J:>I3 


GA^ 






GAsa 

GAsb 




0(JV|) 


GAiQa 

GAiQb 




o{J\l) 


GAz2a 
GA32h 






GAe4a 
GAe4t, 

GAqHc 






G^256a 

GA2566 
GA2zec 







TABLE 1. Summary of distance measure (D) scalings for each op- 
timal GAk sequence identified in the ideal pulse limit, for a fixed 
pulse-interval of r^. 



weakness in GAs2a since further concatenation reduces the 
region of optimal performance; compare Figs. 2(d) and (f). 
The placement of GA4 in the concatenation is believed to ul- 
timately attribute to the attenuated performance of GA2566. 
We suspect that there will be an eventual critical level of con- 
catenation after which optimality no longer exists for concate- 
nated versions of GA32a due to the first order error suppres- 
sion of GA4. (This effect is similar to that seen for asymmet- 
ric QDD, where the performance is bounded by the order of 
error suppressing corresponding to the nested sequence with 
the smallest number of pulses [40].) 

We summarize the scaling of D for each of the optimal 
sequences discussed above for all i^opt in Table I. The op- 
timal sequences located here achieve an additional order of 
decoupling, on average, over the standard XI4 -based CDD 
sequences. For example, compare GAq^c and GA256C to the 
additional optimal sequences. 



7. Interlude: optimal CDD sequences 

Having found that the shortest sequences achieving order 
7V=1, 2, 3, 4, 5 decoupling have length = 4, 8, 32, 64, 256, 
respectively, we now consider what is the optimal sequence 
length for concatenation at fixed pulse-interval Td- A base 
DD (iiT, N) sequence of length K achieving Ni\\ order decou- 
pling uses pulses (and hence total time K^Td) to achieve 
order N x q decoupling at concatenation level q, a sequence 
we denote CDD q{K, N). 

Let us first compare K = 4, 8 at their lowest common 
equal decoupling order, i.e., at order 2. At this order the 
K = 4 sequence is CDD2(4, 1) and requires 4^ = 16 
pulses, and the i^T = 8 sequence is CDDi(8, 2) and requires 
8 pulses. Thus the K = 8 sequence is optimal: the compari- 
son at equal higher order decoupling levels 2^, ^ > 1, yields 
Iq2£ ^ pulses for CDD2£(4, 1), compared to 8^^ = 2^^ 
for CDD£(8,2). 
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Decoupling Order 



Number of Pulses 



GA3 



GAe 



1 


41 


= 22 














2 


42 


= 2* 


8^ 


= 2' 










3 


43 


= 26 






32 1 


= 2' 






4 


4" 


= 28 


8^ 


= 2« 






64^ 


= 2« 


5 


45 


= 2^° 














6 


46 


= 2^2 


8^ 


= 2« 


32^ 


= 2" 






8 


48 


= 2^^" 


8* 


= 2'^ 






64^ 


= 2^" 


10 


410 


= 2^° 


8^ 


= 2^' 










12 


412 


= 2^4 


8« 


= 2^^ 


32* 


= 2^0 


64^ 


^2i8 


15 


415 


= 2^0 






32^ 


= 2^5 






20 


420 


= 2*° 


glO 


= 2^° 






64^ 


= 2=^° 


24 


424 


= 2*8 


gl2 


= 2^« 


328 


= 2*° 


64*' 


= 2'"' 


30 


430 


= 2^0 


gl5 


= 2*5 


32^° 


= 2^0 






40 


440 


= 2«° 


g20 


= 2«° 






64" 


= 2"° 


45 


445 


= 2''° 






32^5 


= 2^5 






60 


460 


= 2120 


g30 


= 2^° 


32^0 


= 2"° 


64^5 


= 2''° 



256^ 2^ 



256^ 



256'' 



256^ 



256^ = 2^^ 
256^ = 2^^ 



256^ 



256' 



TABLE IL Summary of decoupling efficiencies for ideal-pulse concatenated sequences. The first column gives the achievable decoupling 
order, the next five columns give the number of pulses required by each of the GAr base sequences to achieve the desired decoupling order. 
The exponent q in each entry of the form is the concatenation level. The shortest sequence lengths needed for a given decoupling order are 
boxed. 



When we include DD(32,3) the lowest common decou- 
pling order is increased to 6. At this order the = 4 sequence 
is CDD6(4, 1) and requires 4^ = 2^^ pulses, the = 8 se- 
quence is CDD3(8, 2) and requires 8^ = 2^ pulses, and the 
K = 32 sequence is CDD2(32,3) and requires 32^ = 2^° 
pulses. Thus the K = 8 sequence is again optimal. 

When we include DD(64, 4) the lowest common decou- 
pling order is increased to 12. At this order the i^T = 4 se- 
quence is CDDi2(4, 1) and requires 4^^ = 2^^ pulses, the 
K = 8 sequence is CDD6(8,2) and requires 8^ = 2^^ 
pulses, the K = 32 sequences is CDD4(32,3) and requires 
32^ = 2^0 pulses, and the K = 64 sequence is CDD3(64, 4) 
and requires 64^ = 2^^ pulses. Thus the i^T = 8 and K = 64 
sequences are optimal. 

Finally, when we include DD(256, 5) the lowest common 
decoupling order is increased to 60. At this order the K = 
4 sequence is CDD6o(4, 1) and requires 4^^ = 2^^^ pulses, 
the = 8 sequence is CDD3o(8, 2) and requires 8^° = 2^° 
pulses, the K = 32 sequence is CDD2o(32,3) and requires 
3220 ^ 2^00 pulses, the = 64 sequence is CDDi5(64,4) 
and requires 64^^ = 2^^ pulses, and the K = 256 sequence 
is CDDi2(256, 5) and requires 256^^ = 2^^ pulses. Thus the 
K = 8 and K = 64 sequences are once again optimal. 

An exhaustive list of all lowest common decoupling 
orders generated from the base orders A/'g{1,2,3,4,5} is 
Mg{1, 2, 3, 4, 5, 6, 8, 10, 12, 15, 20, 24, 30, 40, 45, 60}. It is 
straightforward to check that whenever CDDg(8, 2) achieves 
one of these decoupling orders, i.e., whenever M is even, 
then CDDq(8,2) requires the smallest number of pulses in 



comparison to all other CDDq{K^ N) sequences capable of 
achieving the same decoupling order (i.e., with K dividing 
M). The same is true for CDDq(64,4), which requires the 
same number of pulses as CDDq(8, 2), except that it cannot 
be used when M G {10, 30} (not divisible by 4). The only 
exceptions to the superiority of CDDg(8, 2) [or CDDg(64, 4)] 
are, of course, the cases with M odd, namely M = 3 and 
M e {5, 15}, where the optimal sequences are, respectively, 
CDDg(32, 3) and CDDg(256, 5). A complete classification is 
presented in Table II. 

B. Finite-width Pulses 

We now extend the analysis to include errors due to finite- 
width rectangular pulses of duration r^. We note that in this 
case EDD is a known way to achieve first order pulse-width 
error suppression, with the added assumption that pulse shap- 
ing is possible [45]. Each optimal sequence construction is 
examined with respect to J/^ e [10"^^, 10^] for (3 = IkHz 
and Tp/Td e [10"^, 10^] for = 0.1ns. The set of allowable 
control pulses 

e = {/,x,y,z,x,?,z} (70) 

contains three additional pulses {X, F, 2} corresponding to 
180-degree phase flips of the non-trivial pulses {X, F, Z}, 
which are generated by taking A —A in Eq. (19). Note 
that in contrast to the ideal pulse case, the identity operator 
I = ^-^^p^o so that for a given K the cycle time is equivalent 
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for all sequences. The remaining unitary pulse operators are 
generated from Eq. (19) and given by 



By. A distinc- 



X{Y,Z) 



— g-irp(Acr 



x(y,z) 



+^o) 



(71) 



where the internal Hamiltonian Hq is now included due to the 
fact that the pulses are now finite in amplitude. This form can 
be reconciled with the ideal pulse operators given in Eq. (44) 
by considering infinite pulse amplitude (A ^ oo) and a pulse 
duration that is effectively zero (rp 0) so that \\Hc{t)\\rp = 
7r/2 is maintained and 1 1 i^o 1 1 ^ . 

The finite amplitude assumption essentially defines a pulse 
duration that differs from zero, so that Hq becomes a rel- 
evant contributor to the dynamics generated by the pulse. 
Of course we must still make the strong pulse assumption 
||^c(^)|| ^ 1 1^0 1 1 in order to reduce the additional errors 
generated by Hq during the pulse. We enforce this assump- 
tion in the following section and utilize it to calculate effective 
pulse dynamics where the contributions of Hq are essentially 
perturbations to Hc{t). The effective pulse operators are then 
used to calculate effective error Hamiltonians for each finite- 
width-robust GA sequence. 

The dimension of the reduced search space is increased by 
the additional pulses to Mr{K) = 7^~i at /max- Conse- 
quently, the initial search space contains 7^ = 49 configura- 
tions at all values of K. We find that not all 49 sequences are 
required to obtain optimal sequences. Surprisingly, an initial 
population of 16 sequences chosen at random from the 49 is 
sufficient to converge on global extrema for all K. 



1. 4-Pulse Sequences 

The optimal sequence structures for i^T = 4 is nearly equiv- 
alent to the ideal pulse case, differing by phase adjustments 
to effectively combat the additional error introduced by pulse 
width. The generalized structures are given by 



RGA^ —P2P1P2P1, 
RGA^> -P2P1P2P1. 



(72) 
(73) 



where we refer to the pulse error-optimized sequences as ro- 
bust GAk (RGAk)- In order to determine the order of error 
suppression associated with the pulse-width, we examine the 
scaling of the performance as a function of Tp. The perfor- 
mance is found to scale according to 



(9( J/3rJ) : J « /3, Tp « Td 

0{JTp) : J <^P,Tp^Td 

0{J^tI) : J » /3, Tp « Td 

[ 0{JTp) : p.Tp^Td 



(74) 



for both sequences, which is in agreement with direct calcula- 
tion of the effective error Hamiltonian for each sequence when 
{Pi,P2} = {X,r}: 



RGA4 



GA4. 



TTTc 

4r„ 



GA4 



(75) 
(76) 



where Tc 4.{Tp + Td) and B^-y B^ 
tion between the two sequences is apparent at first order in 
Tp, where the dominant error term for RGA/^, isolated along 
the cr^ error channel, is a difference between two bath opera- 
tors and the additional phase adjustment included in RGA4' 
generates a sum of errors along two error channels: and 
cr^. The former is preferable since the difference bath operator 
Bx — By can vanish, while the sum operator appearing in the 
expansion of RGA4/ cannot. For randomly selected bath oper- 
ators this translates into a smaller operator norm, on average. 
The resulting difference between the two sequences is indeed 
very visible in Fig. 3(a), which shows the regions of optimal 
performance. RGA4' is generally optimal for jTd ^ 1 and 
J /3; the remaining region is dominated by RGA4. 



2. 8-Pulse Sequences 

Three optimal sequences are identified for = 8 which 
possess similar characteristics to GAsa, GA^b, and a well- 
known 8-pulse decoupling sequence designed to combat 
finite- width pulses. Adjusting pulse phases to obtain ro- 
bustness against finite- width pulse errors, the generalized se- 
quence structures are 



RGA^a' 
RGAsb 
RGAsc 



IP1P2P1IP1P2P1 (77) 
p3{RGA^)Ps{RGA^), (78) 
P1P2P1P2P2P1P2P1 (79) 



The first sequence identified is a robust version of GAsa, dif- 
fering from the ideal pulse construction by the two inverted 
P2 pulses. The prime is used in anticipation of later results 
where RGA^a is introduced in Eq. (105) below. The RGAg^ 
sequence utilizes a concatenation between a robust version of 
generalized CPMG, PsfrdPsfrd^ encompass RGA4, while 
RGAsc denotes a generic version of the Eulerian DD (FDD) 
sequence derived using walks on Cay ley graphs [45]. 
RGAsa' performance scales according to 



0{JpTpTd) 

0{JPtI) 

0{J%Td) 

0{M 



J <^(3,Tp<^Td 
J <^P,Tp:^Td 

J:^ p,Tp<^Td 

J'>f3,Tp'>Td 



(80) 



which is dictated by the effective error Hamiltonian 



rjRGAg 



WTdTp 



cjyBl 



^TdTp 



(jy{Bx,By}^o{jpTdTp) 



TTTc TTTc 

(81) 

computed for the specific case of Pi = X and P2 = Y. First 
and second order error suppression in Tp and Td is achieved, 
respectively, yielding the quadratic term of 0{J'^TdTp) as the 
dominant decoherence generator when J (3 in the strong 
pulse regime (Tp <C r^^). As we will illustrate below, RGAsa' 
is the preferred sequence construction for a majority of the 
parameter regimes considered due to the fact that it provides 
considerable suppression of 0{J'^TdTp) terms over RGA^^ 
and RGA^c- 
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(b)K = 8 



logio(Tp/Trf) -6 

(c) K = 16 



logio(J/>8) 





10gio(Tp/T^) 

(d)K: 



logioCTp/rrf) -6 



32 



e 



RGA4 
,RGA4 



RGAs 

|rga« 



(e) K = 64 

IrGAsc 

nxRGAi6fl 



logio(Tp/r^) ■ 

(f)K = 256 



I 



IRGA16Z, 

|RGA32c 



lmxRGA64< 



FIG. 3. (color online) Performance of optimal RGAr sequences for = 4, 8, 16, 32, 64, 256 shown in (a)-(f), respectively, as a function 
of J//3 and Tp/rd when DD is subjected to finite pulse duration. The norm of the bath Hamiltonian is fixed at /3 = IkHz, while J//3 G 



[10 



^,10^]. The pulse-interval Td 



0.1ns and the pulse width is varied in the range Tp/rd G [10 , 10 ]. For a given K, the optimal 



sequence configuration is most sensitively dependent upon variations in J. Contrary to the ideal pulse case, concatenated structures composed 
of RGAsa' and RGAgc appear to be the most favorable, in particular for K > 16 where RGAiQa^ and RGAq^c repeatedly emerge as optimal 
sequences. Sequence performance saturates 3i K = 16, while robustness begins to diminish dX K — 256. n x RGAiq^ denotes n cycles 
of RGAiQy ; n = 1, 4, 16 for = 16, 64, 256 respectively. The notation is similar for m x RGAq^c, where m = 1, 4 cycles are used for 
K = 64, 256. 



For the second sequence, RGAsb, the performance is dic- 
tated by the distance measure scaling 

D^O{Jrp)yj,P,rp,rd, (82) 

where the elementary concatenation procedure appears to only 
benefit the suppression of terms solely proportional to the 
pulse-interval. Computing the effective error Hamiltonian for 
RGAsb for both possible sequence variations, P3 = Pi and 
P3 ^ Pi, we find 

H^rr%=x ^ — + cjyB,) (83a) 

TTTq 

<^p1=z « ^ + '^'{B. - 2By)] (83b) 

TTTq 

for the specific case of {Pi,P2} = {X, F}. Note that by 
definition P3 7^ P2 for RGAsb- Adhering to the scaling 
demonstrated numerically by Eq. (82), both sequence vari- 
ations convey a linear scaling in Tp that is accompanied by 
a sum of two decoherence-generating terms which are dis- 
tributed among two error channels. The primary distinction 
between Eqs. (83a) and (83b) is the difference bath operator 
Bx — 2By, which appears in the P3 = Z case. On average, 
we can expect this term to exhibit a larger operator norm and, 
therefore, a less desirable performance than the P3 = X case. 



Although such differences in sequence design can be expected 
to effect the structure of the effective error Hamiltonian, it is 
seemingly unclear how one would determine which config- 
uration is the most favorable a priori. In the discussions on 
flip-angle errors, we address this issue again and show that 
such adjustments may, in fact, result in very different distance 
measure scalings. 

The remaining sequence, RGAsc, attains first order error 
suppression in Tp by traversing the Cay ley graph T = r{S^Q), 
where S denotes the single-qubit Pauli group with elements 
denoting the graph vertices and Q = {/, X, F, Z} is the gen- 
erating set comprising the edges. The original construction 
(captured by RGA^c) utilized two closed Eulerian cycles on F 
such that the second is completed by returning along the first 
path. However, additional paths exist which do not require 
closed cycles to obtain first order suppression in Tp. One such 
case is RGA^a', where the initial path and its inversion are 
both open Eulerian paths, as illustrated in Fig. 4. In compar- 
ing RGAsa' and RGA^c, the most important aspect appears 
to be the manner in which the paths are traversed rather than 
their closure. Note that variations in pulse phases may aid in 
the error suppression process, but are not necessarily required 
to obtain first order decoupling in Tp as GA^a also performs 
the task. 
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In terms of performance scaling, RGAgc results in 



0(J/3r|) 
0(J/?t2) 
0(JV|) 
0(j2r|) 



J < /?, Tp > Td 
J > /?, Tp < Td 
J > /?, Tp > Td 



(84) 



where first order decoupling in Tp and is clearly indicated. 
Note that second order error suppression in is not obtained 
for RGAgc, hence the advantage of RGAga' in the strong 
pulse regime. As Tp — > when J » /3 the effective error 
Hamiltonian for RGAgc is essentially given by 



RGAs 



■KTn 



.^J^„y(^Bl + {B,,By}) + 0{j\l) (85) 

Note the larger number of effective system-bath interaction 
terms present for RGAsc as compared to the effective error 
Hamiltonian of RGA^a' described in Eq. (81). A similar anal- 
ysis can also be done in the Tp ^ Td regime to further convey 
the superiority of RGA^a' over RGAsb and RGA^c- 

In Fig. 3(b), we support the above results for RGA^a', 
RGAsb and RGA^c with a numerical analysis over a wide 
range of J and Tp. As suspected from the performance 
scalings [Eqs. (80)-(84)] and effective error Hamiltonians 
[Eqs. (81)-(85)], RGA^a' is the preferred sequence for a ma- 
jority of the parameter space. RGA^b and RGA^c emerge as 
the optimal sequence configurations as J /3, with RGAgb 
dominating the Tp <C Td regime and the first order suppression 
of RGAsc in Tp becoming more favorable when Tp^ Td- 




IXYXIXYX 



FIG. 4. (color online) Pictorial depiction for the action of RGA^a' 
as an Eulerian path along the Cay ley graph V{S^Q) with vertices 
S = {/, X, Y, Z} and generating set G = {/, X, Y, Z}. Note that 
unlike the FDD construction [Fq. (79)], RGA^a' is generated by 
Eulerian paths, rather than cycles. 



RGAiQb' dominates the region J < j3. li is not possible to 
deduce an explanation for the partitioning of the space from 
the numerical scaling method since both sequences exhibit the 
same distance measure scaling 



D { 



0{JpTpTd) 

0{JPtI) 

0{JpTpTd) 

OiJ'r^p) 



J <^l3,Tp<^Td 
J<^(3,Tp:^Td 

J:>(3,Tp<^Td 

P.Tp^Td 



(88) 



3. 16- Pulse Sequences 

Robust sequences at K = 16 can be expressed in terms of 
RGAsa' and first level concatenations of RGA4, and RGA^^ . 
The generalized sequence structures are given by 

RGArea^ := Ps{RGA8a^)Ps{RGA8a^) (86) 
RGAiQb' := RGA^[RGA^>] (87) 

where the primes denotes the difference in phase config- 
uration among the pulses as described in Sec. IV B 1 and 
IV B 2. The first sequence, RGAi^a', is a concatenation be- 
tween Psfr^Psfrd and RGAsa', while RGAiq^' is a single- 
level concatenation that involes both RGA4 and RGA^^ . This 
particular configuration is found to be more suitable for com- 
bating pulse-width errors since it allows for first order sup- 
pression of terms proportional to Tp while additional configu- 
rations, such as RGA4' [RGA^^] and RGA^^ [RGA4], do not. 
Note that RGA^lRGA^] does not appear either even though 
it achieves first order error suppression in Tp due to the more 
significant error suppression provided by both RGAiQa' and 
RGAiQb'- 

As shown in Fig. 3(c), the regions of optimal perfor- 
mance are essentially divided equally among RGAiQa' and 
RGAiQb' . RGAiQa' tends to be optimal for J > (3 \/Tp, while 



In order to gain more insight, we turn to the effective error 
Hamiltonian for each sequence and determine that 

+ ay[Bo,By - -B,]+a'[Bo,B, + -By]) 

(89) 

and 

frRGA,^,, _ SiTpTd r 2 4 

^err ^ CT [Bq, By Bz\ 

Tc TT TT 

Tc TT 

+ ^^a^[Bo,B^--B,]. (90) 

Tc TT 

in the strong-pulse regime. The primary distinction between 
the two constructions is the additional term in the -channel 
for RGAiQb', which explains the preference in RGAiQa' 
when J > j3 and the relatively equal sharing of the parameter 
space when J < /3. In the remaining region where Tp > Td 
partitioning of the space can also be explained in terms of the 
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effective error Hamiltonian, where 



for RGAiQa' given in Eq. (89) to that of RGAs2c given by 



(91) 



jjRGA^Q 
J^err 



■-a^{Bl + Bl + Bl + {By,B,}) (92) 



clearly agrees with the more pronounced dominance of 
RGAiQa' when J > (3 indicated in the numerical results. An 
additional analysis confirms a higher degree of error suppres- 
sion for RGAiQ})' when J < {3 and Tp > Td- 

The suppression of 0{J'^TdTp) terms is the most substan- 
tial benefit of the i^T = 16 sequences over i^T = 8. RGAiQa' 
achieves this additional decoupling by concatenating RGA^a' 
with an elementary one-dimensional sequence to suppress 
the single-channel decoherence of 0{J'^TdTp) remaining in 
Eq. (81). RGAiQ})f requires a more complicated concatena- 
tion process to attain a similar effective error Hamiltonian 
since it is also necessary to remove the 0{Jrp) terms not sup- 
pressed by RGA^^ . 



4. 32-Pulse Sequences 

Optimal performance at K = 32 is characterized by two 
particular sequence constructions: 2 x RGAiQa' and 



RGAs2c = RGAsclRGA^]. 



(93) 



As shown in Fig. 3(d), RGAiQa' is the preferred choice for 
J < (3 ^Tp^Td, as well as J > /3 when Tp <C Td, while 
RGAs2c dominates the J > /3 with Tp > Td regime. Using 
the scaling of RGAiQa' given in Eq. (88) and 



D { 



0{JpTdTp) 

0{JpTdTp) 
0{JpT^p) 



J <^P,Tp<^Td 
J <^P,Tp:^Td 
J:^ P,Tp<^Td 

J'>(3,rp'>rd 



(94) 



determined for RGAs2c^ we can conclude that RGAiQa' 
is not the desired sequence construction for interaction- 
dominated dynamics when Tp > Td due to the presence 
of 0{J'^Tp) terms in the effective error Hamiltonian [see 
Eq. (89)]. The concatenated construction designated by 
Eq. (93) alleviates this problem and achieves complete sup- 
pression of (9(J^r^) terms. In fact, RGA2,2c turns out to 
be quite special in that additional sequence structures such as 
RGA^a'[RGA^], RGA^[RGA^a'], and RGA^[RGA^c] are 
unable to attain such a high level of decoupling. Most no- 
table are those which utilize RGA4^ at the outer layer of the 
concatenation and fall victim to the less desirable decoupling 
properties of the i^T = 4 sequences. 

In the remaining regions of the parameter space both se- 
quences scale equivalently, yet RGAiQa' is found to be the 
optimal sequence. Comparing the effective error Hamiltonian 



TT 

^G'\B^,B,--B,^-By\\, (95) 



the lowest order difference between the evolutions is es- 
sentially an additional term along the error channel for 
RGAs2c- Consequently, RGAiQa' achieves a higher perfor- 
mance than RGAs2c' when j3 > J and in the strong-pulse 
regime when (3 < J. Note that RGAiQa' is also optimal 
for a small region of the parameter space when J ^ P and 
^ 'Td- This sudden appearance of RGAiQa' occurs as 
jTd 1, where the 0{J'^t'^) scaling is more favorable than 
the 0{J^r^) scaling for RGA32c- 

The most interesting aspect of the K = 32 analysis is that 
RGAs2b achieves error reduction, yet the performance re- 
mains 0{TpTd) in the strong pulse regime indicating a possible 
saturation in performance and significant error suppression 
capabilities. This artifact is due to the presence of pulse im- 
perfections in the effective error Hamiltonian that may not be 
suppressed further simply by adjusting the sequence config54- 
uration. For this reason, it may be necessary to consider pulse 
shaping techniques to effectively average out additional finite- 
width contributions [72-74]. 



5. 64 -Pulse Sequences 

In the case of K = 64, robust DD is mainly characterized 
by the single layer concatenation of RGAsc- 



RGAq4c — RGAsc[RGAsc] 



(96) 



and 4 x RGAiQa'. The distinction between the sequences 
is evident from Fig. 3(e), where RGAq^c is preferred in the 
interaction-dominant regime primarily when Tp Td and 4 x 
RGAiQa' is optimally robust when the environment dynamics 
dominate. The partitioning of the space can be understood 
by comparing the performance scaling of RGAiQa' given in 
Eq. (88) and 



D < 



0{JpTdTp) 
OiJpTdTp) 

0{Jf3r^) 



J ^P.Tp^Td 

J<^(3,rp:^rd 
J:^p,Tp<^Td 

P.Tp^Td 



(97) 



determined for RGAq^c, most notably when J ^ {3 and 
'Tp^ where terms of (9 (J^r^) are effectively suppressed 
for RGAq/ic For J ^ /3, it is not possible to draw any 
conclusions regarding sequence preference from the scaling 
equations since both exhibit identical performance scalings. 
Therefore, we turn to the effective Hamiltonian to gain some 
additional insight. 



For the case of RGAiQa' the effective error Hamiltonian is 
presented in Sec. IV B 3 [see Eq. (89)], while for RGAq^c we 
find 

Tc \ TT 

+ay[Bo,By - -B,]+a'[Bo,B, + -By]] . 

(98) 

Interestingly, both sequences have effective error Hamilto- 
nians that contain similar low-order terms in the /3 <C J 
regime. Substantial differences appear in higher order terms 
of OiP^Jrjrp) and OiP^Jrar^), where RGAiQa' generates 
far less effective decoherence than RGAq^c- This result is 
consistent with the values of sequence performance which 
tend to deviate more strongly as (3 increases beyond J. In 
the interaction-dominant regime, the advantage of RGAq^c 
over additional sequences, e.g., 2 cycles of RGAs2c^ is the 
increased suppression of O^JjSr'^) and 0{Jf3rprd) evident in 
the effective error Hamiltonian when Tp > Td (not shown) 
and Tp < Td [compare Eq. (95) and Eq. (98)], respectively. A 
similar performance and effective Hamiltonian analysis also 
conveys the inadequacies of RGA^a' [RGA^a' ] for suppress- 
ing 0{JpTdTp) and 0{J'^tI) terms. 

6. 256-Pulse Sequences 

The space of optimal sequences for K = 256 is character- 
ized by previously identified sequences: 16 x RGAiQa' and 
4 X RGAq4c with performance scalings given by Eqs (88) and 
(97) and effective error Hamiltonians presented in Eqs. (89) 
and (98), respectively. In Fig. 3(f), we present a compre- 
hensive numerical summary for the regions of optimal perfor- 
mance for i^T = 256 as a function of J and Tp for fixed /3 and 
Td. Note the similarities between the K = 256 and K = 64 
numerical results which are nearly identical, as RGAiQa' is 
optimal for J < (3 and RGAq^c is favorable for J > /3, due 
to the preference in optimal sequence structure. More intricate 
sequence constructions such as 

RGA2^6c := RGA^RGAe^c] (99) 

suffer from an accumulation of 0{Jrp) terms generated by 
the concatenation with RGA4 on the outer level. A per- 
formance scaling similar to that of RGAq^c can be re- 
established by reversing the concatenation order according 
to RGAq4c[RGA4], yet an accumulation of 0{J/3rdrp) and 
0{J'^rp) terms renders this sequence structure less favorable 
than 4 x RGAq^c 

A summary of the performance scaling equations for all 
optimal sequences discussed in Sec. IV B is presented in 
Table III. Note that first order error suppression, in r^, is 
achieved for a majority of i^opt- However, we are only able 
to demonstrate the reduction of second order decoherence op- 
erators, such as the suppression of 0{J'^TdTp) terms for cer- 
tain cases, and not complete suppression of 0{TpTd) or 0{rp) 
terms. This result is consistent with DD no-go theorems 
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Sequence 


J</3 


J:>f5 


Tp < Td 


Tp > Td 


Tp < Td 


Tp » Td 


RGA4 


0{JpTl) 


0{jTp) 


0{PtI) 


0{jTp) 


RGA^> 


0{JI3t!) 


O(jTp) 


0{.PtI) 


0{jTp) 


RGAsa' 


OiJpTdTp) 


0{JPT^) 


0{j''TdTp) 


0{J^tI) 


RGAsb 


0{jPTdTp) 


0{jTp) 


0{Jtp) 


0{Jtp) 


RGAsc 


o{jpt!) 




0{J\1) 




RGAiQaf 


0{JpTdTp) 


OiJpTdTp) 


OiJpTdTp) 


0{J\'p) 


RGAiQjjf 


OiJpTdTp) 


0{JpTaTp) 


O(JpTdTp) 


0{.Pt'p) 


RGA32C 


OiJpTdTj,) 


0{JpT^) 


OiJpTdTp) 


o{jptI) 


RGAq4c 


OiJpTdTj,) 


0{JpT^) 


OiJpTdTp) 


O(JPT^) 


RGA256C 


OiJpTdTp) 


0{jTp) 


OiJTp) 


0{jTp) 



TABLE III. Summary of distance measure (D) scalings for each opti- 
mal RGAk sequence located by our search algorithm, for DD evolu- 
tion subjected to finite- width rectangular pulses of duration Tp pulses 
and pulse-interval Td. 



which prove that it is not possible to suppress decoherence 
operators that are manifested by the second order perturbation 
expansion for the pulse error evolution operator, i.e. 0{TpTd) 
and 0{rp) terms, when rectangular pulse profiles are utilized 
[75, 76]. Our analysis is consistent with these theorems and 
further conveys the need to utilize pulse shaping techniques 
in conjunction with optimal sequence construction to achieve 
high order error suppression in the presence of finite-width 
pulses. Indeed, when liberated from the constraint of rectan- 
gular pulse profiles, pulse sequences using DCG and CDCG 
[46-48] may be employed when pulse-width errors are the 
dominant concern. 



C. Flip-angle errors 

An additional form of pulse error we consider is that of a 
flip-angle error. In Fig. 5, we analyze the performance of DD 
sequences optimized for flip-angle errors with respect to the 
ratio J//3 G [10-^ 10^] for /3 = IkHz and e G [0,0.2]. The 
results are averaged over 10 realizations of Eq. (15). 

The control pulse set Q is equivalent to Eq. (70), with the 
pulse profile defined in Eq. (20). The resulting unitary pulse 
operators are given by 

X(r,Z) = e-^-/2(i+^)"^''"' (100) 

and {X, F, 2} = {X^, F^, Z^}. The analysis is symmetric 
with respect to over or under-rotations, therefore, our focus on 
over-rotations does not result in a loss of generality. As in the 
case of finite- width pulses, the initial population is considered 
to be only 16 of the 49 initial configurations chosen at random, 
each with an equal probability. 

1. 4-Pulse Sequences 

In the case of flip-angle errors, the optimal sequence con- 
struction for i^T = 4 is primarily RGA2 = P2fTdP2fTd- The 
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performance can be shown to scale according to 



D < 



0{Jrd) 
0{eJrd) 
0{Jrd) 
[ 0{eJrd) 



J <^ e:^ jTd 
J e<^ jTd 

J f3, e:^ jTd 



(101) 



which is in direct agreement with the effective error Hamilto- 
nian: 



RGA2 



{cjyB.-cj'By). 



(102) 



In Fig. 5(a), the performance for RGA2 is shown as a function 
of e and the ratio J//3. RGA2 is the optimal sequence for 
all values of e and J within the designated parameter range, 
except for a particular region at J//3 = 10^, where RGA4 
is preferred. Analyzing the effective error Hamiltonian for 
RGA4 in the presence of flip-angle errors, we find 



RGA4 



STd 



[cj^B,_y^cjy{By-2B,)l (103) 



which contains an additional error term of 0{e^ /rd). An ex- 
amination of the performance yields 



D 



J <^ (3, e<^ jTd 
J <^ e:^ jTd 
J f3, e <^ jTd 

J e:^ jTd 



(104) 



where the term quadratic in e shown in Eq. (103) indeed dic- 
tates the scaling of the performance in the e ^ Jrd regime. 
Since the magnitude of the flip-angle error is determined by 
the accuracy of the experimental equipment and cannot be di- 
rectly manipulated, terms which are not proportional to the 
delay time are significantly more destructive to DD evolution. 
An important question to be addressed is whether high order 
error suppression is attainable for errors which are solely pro- 
portional to the flip-angle error, or if the amplitude of such 
error terms can be reduced dramatically, simply by sequence 
construction. 



2. 8-Pulse Sequences 



We identify 



RGAg 



IP1P2P1IP1P2P1 



(105) 



as the only optimal sequence for K = 8 in the pres- 
ence of flip-angle errors. The structure is similar to GA^a 
and RGAsa', differing only by pulse phases, and identical 
to a time- symmetrized version of RGA4 sequence, namely 
RGA4RGA4. Time-symmetrization has long been know to 



be beneficial for DD sequence construction since all odd- 
order terms in the effective error Hamiltonian are averaged 
out [25, 77], even in the case of pulse errors [13]. The effect 
of symmetrization is apparent from the significant increase in 
performance amplitude, implying a reduction in error terms 
O(e^); see Fig. 5(b). In fact, a direct analysis of the scaling of 
the performance reveals complete cancellation of O(e^) and 



D^O{eJrd) yj^P.e.Td. 



(106) 



The scaling is supported by the effective error Hamiltonian for 

RGAga- 



ffRGAsa ^ -^cr'B + — 



-cjyB^, 



cjy{2B,-By), 



(107) 



where the 0{eJrd) term is accompanied by two additional 
0{e^jTd) terms. In order to provide successive error sup- 
pression of terms 0{e^), n > 1, it is necessary to continue 
producing dominant decoherence operators that are linear, or 
even quadratic, in Jrd to effectively control the amplitude of 
the errors associated with e. 



3. 16- Pulse Sequences 

Optimal performance fovK = 16 is obtained for a robust 
version of GAiQa denoted by 



RGAiea = P3 {RGAsa) P3 (RGAsa) • 



(108) 



The structure utilizes a robust version of the generalized 
CPMG sequence, RGA2, to encompass the RGAsa sequence 
and achieve a performance scaling 



D^O{e^jTd) yj,(3,e,Td. 



(109) 



The additional level of concatenation enhances robustness and 
results in a quadratic improvement in error suppression over 
RGAsa- An analysis of the performance for RGAiQa as a 
function of e and the ratio J/ f3 is provided in Fig. 5(c). 

In constructing the RGAiQa sequence there is a freedom 
in the choice of P3. Similar to our discussion in Sec. IV B 2 
regarding the possible sequence variations of RGA^i,, let us 
consider {Pi, P2} = {-^, ^} and analyze the effective error 
Hamiltonian for each possible non-trivial P3 variation. Taking 
P3 = X, F, Z, we obtain the following effective error Hamil- 
tonians: 



2 2 

RGAiQa ^ ^ ^ \rr^B -rryp ] 



(110a) 



2 2 

H^Sl]^^ - ^[(j''Bx_y - ay{2B, - By)], (110b) 



^err,P3=^ ^ 4 
frRGAiea _ ^^z-n 



(llOc) 



The first two cases remain quadratic in e and consistent with 
the scaling of Eq. (109). The last case exhibits a dimin- 
ished error suppression and scales as 0{eJrd), the same as 
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(d) K = 32 (q)K = 64 (f)K = 256 





RGA4 


^RGAsA ^|mxRGA64a 


^^RGA2 




^|nxRGAi6« RGA256fl 



FIG. 5. (color online) Performance of RGAr sequences for = 4, 8, 16, 32, 64, 256, as shown in (a)-(f), respectively, as a function of J//3 
and e. The minimum pulse interval is fixed at Td = 0.1ns, J/13 ^ [10~^, 10^], and /3 = IkHz, while e is varied from one to twenty percent 
rotation error. Results are averaged over 10 realizations of 5^. Sequence performance mostly increases from K = 4io K = 64, indicating a 
reduction in the error terms proportional to e in the effective error Hamiltonian. Successive error suppression is achieve for X = 4, 8, 16, 64, 
where the maximum error suppression yields D ^ 0{e^Jrd). for RGAoAa- Multiple cycles of RGAi6a and RGAoAa appear as optimal 
sequences for various Kopt- The number of cycles for each sequence is given as follows: (d) n = 2, (e) m = 1, and (f) n = 16, m = 4. 



RGAsa- Of course, this can be understood from Eq. (107) 
where the term linear in e resides along the channel. Ap- 
plying a Z pulse commutes with this term and, therefore, 
does not provide any additional error suppression. As previ- 
ously discussed, choosing the correct sequence configuration 
is critical in constructing a robust sequence. This is especially 
true in cases like RGAiQa where the outer-level concatenat- 
ing sequence does not address all error channels, i.e., the 
pulse operators do not anti-commute with all possible system- 
environment interaction terms. 



4. 32-Pulse Sequences 

The sequence structures obtained so far for robustness 
against flip-angle errors suggests that a robust version of 
GAs2a would appear as the optimal sequence for K = 32. 
In actuality, robust versions of GAs2a do not exhibit ample 
increases in performance over RGAiQa- The sequence con- 
struction 



and generate an effective error Hamiltonian 



RGAs2a = RGA^RGAsa] 
can be shown to yield a performance scaling 



D 



0{ejpTl) 



(111) 



(112) 



RGA3 



64Td 



a' ^2^ieTda'[Bo,B,_y], (113) 



both of which display a dominant decoherence-generating 
term of O(e^). The preferred sequence structure for K = 32 
is RGAiQa, where two cycles are applied to generate a 32- 
pulse sequence. Saturation in performance is perhaps an indi- 
cation that maximum robustness against flip-angle errors has 
been obtained and attempting to apply more sophisticated se- 
quence structures simply results in error accumulation, rather 
than suppression. Error accumulation is validated here with 
the reemergence of the 0{e^) term in Eq. (113), that had 
been previously suppressed by RGA^a- Furthermore, the 
reduced performance can be attributed to the outer layer se- 
quence, RGA4, which generates 0{e'^) terms that essentially 
destroy the additional error suppression achieved by time- 
symmetrization. One way of overcoming additional decoher- 
ence brought about by flip-angle errors is to use composite 
pulses which essentially mimic the application of a single tt- 
pulse by applying a number of variable angled pulses in suc- 
cession [33, 54]. A summary of = 32 performance is given 
in Fig. 5(d), where n x RGAiQa with n = 2 denotes two suc- 
cessive cycles of RGAiQa- 
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5. 64 -Pulse Sequences 

Robust DD is achieved at K = 64 by 

RGAe^a = RGAsalRGAsa], (114) 

a robust version of GAQ4a- In contrast to K = 32, applying 
multiple (in this case four) cycles of RGAiQa does not attain 
optimal performance for K = 64. Examining the scaling of 
the performance, we find that RGAQ^a attains an additional 
order of error suppression over RGAiQa that is immediately 
evident from 

Dr^Oie^jTd) yJ,pe,Td (115) 

and 

ffRGA,.. ^ t^a^By - ^a^B^, (116) 

where the 0{e^JTd) term is effectively suppressed. In the 
previous discussion on i^T = 32 sequences, we noted that 
it is possible that the outer RGA4^ sequence diminished per- 
formance due to its lower order error suppression. The fact 
that an additional order of error suppression over K = 16 
is achieved here for a concatenated structure comprising two 
RGAsa sequences, both of which supply the same order of 
error suppression, further validates this claim. Therefore, one 
of the important aspects of sequence construction for concate- 
nated protocols is to utilize sequences which at least obtain 
the same order of error suppression. Figure 5(e) summarizes 
the K = 64 performance over a range of e and J/ j3 values. 

6. 256-Pulse Sequences 

In the case of i^T = 256, the optimal sequence configura- 
tions appear as multi-cycle versions of previously identified 
sequences. Optimal performance in the system-environment 
interaction-dominant (J > (3) regime is achieved by 4 x 
RGAQ4a, while for J < (3 the parameter space is shared 
by 16 X RGAiQa and 4 x RGAQ^a- As shown in Fig. 5(f), 
sequence performance eventually degrades further and it be- 
comes more favorable to not apply DD at all for large e. 

Similar to K = 32, robust versions of i^T = 256 sequences 
are not found to be optimal. Previously located sequences 
suggest that 

RGA2^6a = RGA^lRGA^^al (117) 

may serve as a robust configuration for flip-angle errors; how- 
ever, this turns out to be false. We attribute the superiority of 
multi-cycle sequences to the outer-most level of concatenation 
of RGA256a since it does not achieve the same decoupling or- 
der as the inner levels. Compared to RGA^^a, RGA256a re- 
ceives a substantial reduction in performance by the addition 
of the outer RGA4 sequence and re-emerges with a distance 
measure scaling 

D^O{e^) yj,p,e,rd, (118) 



Sequence 




J>/3 




e < Jrd 


e » jTd 


e < jTd 


e > jTd 


RGA2 


0{JTd) 


0{tjTd) 


0{jTd) 


O(tjTd) 


RGA4 










RGAsa 


0{eJTd) 


OitjTd) 




OieJTd) 


RGAiQa 


0{e'jTd) 






Oi^jTd) 


RGA32a 










RGAQ4a 




O(e'jTd) 


O(e'jTd) 


O(e'jTd) 


RGA256a 


0{e') 


0[^) 


0(^) 


0[^) 



TABLE IV. Summary of distance measure D scalings for each opti- 
mal RGAk sequence located for DD pulses subjected to flip-angle 
errors with rotation error e with fixed pulse-interval Td. 

which is consistent with that of RGA4 given in Eq. (104) for 
certain parameter regimes. The effective error Hamiltonian 

ffRGA,,ea ^ + ^^(a- - ay) 

bUTd 1024r/ ^ 

-^47rHe^TdCT'[Bo,3B, - 2By] (119) 

further substantiates the numerically-determined performance 
scaling. In Table IV, we display the performance scaling for 
K = 256 along with the scalings for the remaining values of 
Kopt discussed above. 

In summary, the results presented for K=Kopt in the pres- 
ence of flip-angle errors demonstrate that successive error 
suppression is achievable by concatenation only if the outer 
sequence maintains the same decoupling order as the inner 
sequence(s). Supplying a lower order decoupling outer se- 
quence ultimately leads to an effective error Hamiltonian that 
possesses dominant error terms that are intrinsic to the low- 
order sequence. Such a situation did not arise for finite- width 
pulse errors due to the absence of pulse shaping. Uninhib- 
ited by no-go theorems for flip-angle errors, we believe that 
extending the search beyond K = 256 will result in addi- 
tional sequence configurations that utilize concatenations of 
RGAsa or even a more robust construction such as RGAiQa- 
This conclusion is supported by the fact that the performance 
scaling obtained for i?GA5i2a = RGAsalRGAgalRGAga]] 
is D 0{e^jTd), which suggests that arbitrary order error 
suppression using i concatenations of RGA^a can be used to 
achieved - 0{e'^^-^Jrd). 

D. Finite-Width and Flip- Angle Errors 

Flip-angle and finite-width pulse errors are prevalent in a 
variety of experimental settings. Therefore, it is necessary for 
DD sequences to be robust against both types of errors simul- 
taneously if reliable computation is to be implemented under 
the protection of DD in realistic setups. Designs based on 
(C)DCG [46-48] do not apply in this case as they do not ad- 
dress flip-angle errors. The KDD sequence is applicable, but 
unlike the present study, it employs non-7r pulses [54]. As 
a final consideration, we investigate the inclusion of the two 
forms of pulse errors and search for optimal sequences at each 
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FIG. 6. (color online) Performance of RGAr sequences for = 4, 8, 16, 32, 64, 256, as shown in (a)-(f), respectively, as a function of e and 
Tp/rd. The minimum pulse-interval is fixed at Td = 0.1ns, while J = IMHz and /3 = IkHz. The flip-angle error is varied from a 1% to a 
10% error and Tp is varied throughout a wide range of values so that Tp <^ Td, and Tp ^ Td, is explored. The majority of the parameter space 
is dominated by the RGAr a sequences, even in the finite- width error-dominant regime. The robustness of RGAr a to pulse imperfections 
ultimately saturates at K = 64. All simulations are averaged over 10 realizations of B^. The optimal sequences that are used in multiple Kopt 
values require the following number of cycles: (a) m = 2, (c) n = 1, (d) n = 2, (e) p = ^ = 1, (f) p = ^ = 4. 



^opt- By performing this search, we are essentially address- 
ing the possibility of constructing fault-tolerant DD in the 
most convenient possible arrangement: fixed pulse-interval 
and rectangular pulse shape. 

The control pulse set Q again contains 7 possible pulses, 
where each pulse operator is defined by 



(120) 



and {X, ?, 2} correspond to 180-degree phase rotations, i.e. 
A ^ —A. The amplitudes of the error and bath Hamiltonians 
are chosen as J = IMHz and (3 = IkHz, respectively. Op- 
timal sequence performance is analyzed with respect to e and 
Tp in the regime where J ^ j3 and jTd <C 1 to characterize 
sequence robustness as a function of errors generated by the 
DD pulses. 



1. 4-Pulse Sequences 

The optimal sequence configurations for K = 4 are RGA2 
and RGA4. Since RGA2 consists of only two pulses, the cy- 
cle is simply repeated twice to generate a four pulse sequence, 
i.e. 2 X RGA2. In Fig. 6(a), 2 x RGA2 is shown to be op- 
timal for a considerable portion of the parameter range con- 
sidered: Tp < Td when e > Jr^. Dominance is exchanged 
at Tp = Td and RGA4 becomes the preferred sequence con- 



figuration. In order to understand the variations in optimal 
sequence selection, we numerically determine the scaling of 
distance measure D and further characterize sequence perfor- 
mance by computing the effective error Hamiltonians for both 
sequences in the regime where jTd <C 1. 

In the case of RGA2, the performance can be shown to 
scale according to 



D 



(121) 



which is dictated by first order terms in either Td or Tp. This 
result is quite different from RGA4, where 



D 



o{.PtI) 

0{jTp) 
0{jTp) 



e <C JTp <C JTd 
JTp <C e <C JTd 
JTp <C JTd <C e 
e <C JTd <C JTp 
JTd <C e <C JTp 
jTd <C JTp <C e 



(122) 



indicates a more complicated scaling structure. Considering 
the case where Jr^ <C Jrd <C e, which is essentially captured 
by e = 10~^ in the strong pulse regime [Tp <C Td\ in Fig. 6(a), 
RGA2 is the preferred sequence configuration due to its scal- 
ing in Tp rather than e^. Performing a similar analysis for 
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jTd <C JTp with e <C JTp is more complicated since both se- 
quences scale equivalently: D ~ 0{Jrp). Consequently, we 
employ average Hamiltonian theory to eludicate the details. 

The evolution generated by RGA2 results in the effective 
error Hamiltonian 



RGA2 



4r, 



2er„ 



TTCTd 



{(jyB, - a' By) , 



(123) 



while for RGAa 



RGA4 



4r^ 



2rc 
4er„ 



TTTr 



(1 



ct'B, 
4 



2er, 



^{cT--cTy)B, 



2r„ 



— Bo, 
2 ^ 



(124) 



The application of two cycles of RGA2 to match the four- 
pulse sequence of RGA4 leads to a factor of 2 increase for 
the RGA2 effective error Hamiltonian over that of RGA4, 
however, the implication is not necessarily that the perfor- 
mance will degrade by a factor of two. An additional pos- 
sibility for A4 -dominance in the e <C Jrp regime is the 
presence of the difference operator B^-y in H^^^^ rather 
than the single bath operator proportional to Tp expressed in 
Eq. (123). RGA4 performance appeared to benefit from the 
difference operator in the case of finite-width pulses so it is 
possible that the same situation exists here. The effective error 
Hamiltonians can be utilized to futher provide consistency be- 
tween the remaining numerically-calculated distance scalings 
and analytical expression, e.g., H^^^'^ conveys a suppression 
of 0{e^) terms, while H^^^ does not. 



2. 8-Pulse Sequences 

Two of the three robust sequence constructions identified in 
previous sections, RGA^a and RGA^c, characterize the pa- 
rameter space for combined pulse errors. The most favorable 
sequence is found to be RGA^a, as shown in Fig. 6(b), where 
it is optimal for all values of Tp and e except Tp when 
e <C 1. The remaining region is dominated by RGA^c as ex- 
pected by the finite- width error analysis presented above. Fo- 
cusing on the interplay between errors generated by the DD 
pulses, the performance for RGA^a is found to scale as 



D 



while for RGAi 



0{eJTd) 
0{eJrd) 
0{Jrp) 



8c 



D 



(9(JVJ) 

0{J^TdTp) 

0{eJTp) 
0{eJrp) 



JTp <C e <C JTd 
JTp <C JTd <C e 
otherwise 



e <C JTp <C JTd 
JTp <C e <C JTd 
JTp <C JTd <C e 
e <C JTd <C JTp 
JTd <C e <C JTp 
jTd <C JTp <C e 



(125) 



(126) 



Using the scaling equations, we can attribute the dominance of 
RGAsa when Jrp <C Jrd <C e to the preference of 0{eJrd) 
scaling, as opposed to cubic scaling in e found for RGA^c, 
in the range of e considered in Fig. 6(b). Taking e < 10~^ 
[or in general < Jrd], the cubic scaling becomes more fa- 
vorable, as one may expect, until e ~ Jrd where the quadratic 
scaling in Jrd begins to contribute significantly to the RGA^c 
scaling. Equations (125) and (126) offer additional justifica- 
tion for the selection of RGA^c as the optimal sequence when 
Jrd <C e <C Jrp since RGA^c has the ability to achieve 
0{Jrp) error suppression. The distance measure scalings for 
RGAsa and RGAsc can be further utilized to substantiate the 
preferred optimal sequence construction in the remaining re- 
gions displayed in Fig. 6(b). 

The effective error Hamiltonians for the optimal i^T = 8 se- 
quences, in the presence of finite-width and flip-angle errors, 
are essentially characterized by 

fjRGM. « ^ [a^B, + a^{B, - 2B^)] - ^a^B^.y 

rc 

(127) 



for RGAsa and 



RGAs 



2r^ r, 



-{Bx + By) 



(128) 



for RGAgc- The dynamics generated by RGAga focus pri- 
marily on the errors generated solely by the flip-angle error 
and do not attain suppression of decoherence operators pro- 
portional to both e and r^. In contrast, RGAgc focuses on both 
errors simultaneously, achieving first order suppression in 
and second order decoupling for errors solely proportional to 
e. RGAsc exhibits many attractive attributes for joint robust- 
ness against flip-angle and finite-width pulse errors, however, 
these features are constrained to sufficiently small e, namely 
< Jrd. For the parameters considered here, this corre- 
sponds to a flip-angle error of e < 0.1% which may be quite 
ambitious for current experimental systems that can exhibit a 
rotation error well over 0.1% [17, 30, 66]. 



3. 16-Pulse Sequences 

Robust decoupling is achieved for i^T = 16 using RGAiQa, 
where performance can be shown to scale as 



D^O{Jrp) We,rp,rd. 



(129) 



In the flip-angle error dominant regime, RGAiQa perfor- 
mance does not appear to be linearly or quadratically de- 
pendent on e, implying the suppression of 0{e^) terms, for 
n = 1, 2. RGAiQa does not acquire robust decoupling against 
finite- width pulses and maintains the first order contribution in 
Tp. As shown in Fig. 6(c), 



= RGA^^IRGA^^] 



(130) 



also appears as an optimal sequence in the regime where 
^ and e < Jrp. This sequence construction differs 
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slightly from RGAiQ^jf in that both levels of the concatena- 
tion are composed of RGA^r^ hence, the double prime no- 
tation. The additional configurations, namely, RGAiqI) and 
RGAiQi)', are not optimal in this case due to their inability 
to suppress decoherence operators linear in e. In contrast, the 
performance for RGAiqij^^ scales according to 



for i^^ = 32 are summarized for a wide range of Tp and e in 
Fig. 6(d). 

From a quantitative standpoint, 2 x RGAiQa superiority can 
be deduced from the scaling of the distance measure D. The 
performance scaling for RGAiQa is given in Eq. (129), while 
for RGAs2a we determine that 



D 




e > jTp and e > Jra 
otherwise 



(131) 



where the superiority of RGAiQa over RGAiQi^f is understood 
by the quadratic scaling in e in the flip-angle dominant regime. 
In the regime where finite-width errors are prevalent, opti- 
mal robustness is obtained by RGAiQiy>>, even though both 
sequences exhibit the same scaling. We compute the effective 
error Hamiltonians for each sequence to determine the origin 
of RGAiQi)ff robustness. 

Let us first focus on the RGAiQa and consider the three 
possible sequence variations: 



Tj-RGAiGa o 

^err,P3=X " 


. 4rp 

TTTc 


{^a'^B, - 


-a^By) 


(132a) 


frRGAiGa . 


TTTc 


{ct'B, - 




(132b) 


frRGAiGa ^ 


TTTc 


[a- By - 


cjyB,^2cj\B, 


- 25.)] 
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for {Pi, P2} = {X^ Y}. Suppression of terms quadratic in e 
is accomplished by all variations, consistent with Eq. (129). 
The distinction between the sequences lies in their ability to 
suppress 0{Jrp) terms, with = Y attaining the most sup- 
pression. In comparison, the suppression of decoherence op- 
erators generated by the finite duration of the pulses is more 
considerable for 
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since only a single term of 0{Jrp) exists. Therefore, 
RGAiQi)'' is the preferred sequence due to its higher degree 
of error reduction, rather than suppression. 



4. 32-Pulse Sequences 

Degradation in robustness is first observed for K = 32, 
where 2 cycles of RGAiQa are found to be a more favorable 
sequence configuration than any other previously identified 
32-pulse sequence for most of the parameter range consid- 
ered. This result can be explained in terms of the dominant 
error type, whether it is flip-angle or finite pulse duration. In 
the e jTp regime, RGAiQa is the optimal sequence due 
to its robustness to flip-angle errors. As the finite duration of 
the pulse becomes more detrimental, e <C Jr^, RGAs2a Per- 
formance surpasses that of RGAiQa- This is not surprising 
in light of the results shown in Sec. IV B 4, where we deter- 
mined that RGAs2a provides first order suppression of error 
terms proportional to r^, while RGAiQa does not. The results 
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In the flip-angle error-dominant regime, the performance for 
RGAiQa remains proportional to Tp while RGAs2a perfor- 
mance transitions from an 0{Jrp) to an (9(e^) scaling. Con- 
sequently, the transition leads to RGAiQa becoming the more 
desirable sequence construction. Optimal sequence selection 
becomes harder to decipher in the Jr^ e regime since both 
sequence scale equivalently. A comparison of the effective 
error Hamiltonian for RGAs2a^ 
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and that of RGAiQa suggests that RGAs2a is more favorable 
when finite-width errors are dominant due to the difference 
operator Bx-y that is only distributed along a single error 
channel. This result is similar to the K = 4 case when only 
finite- width pulses are considered. RGA4 emerges as the op- 
timal sequence over RGA4' as a result of single-channel de- 
coherence generated by Bx-y. 

In addition to RGAs2a, RGAs2c also arises as an optimal 
sequence when Tp^ r^- RGAs2c^ region of optimal perfor- 
mance is highly dependent upon e and mainly characterized 
by e <C 1. From the performance scaling 
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and effective error Hamiltonian 
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it is evident that RGAs2c attains first-order decoupling in Tp 
and exhibits a performance scaling linear in e when pulse- 
width errors are dominant. Hence, the flip-angle error must 
be relatively small (e <C 1) in order for the 0{eJrp) scaling 
of RGAs2c to be advantageous. 



5. 64 -Pulse Sequences 

Robustness to combined finite-width and flip-angle errors 
is regained at i^T = 64, where RGAQ4a and RGAq4c are des- 
ignated as the optimal sequence configurations. Similar to 
K = 32, we find that their regions of optimal performance are 
determined by the form of the dominant error. In Sec. IV C, 
RGAQ4a was shown to be highly robust to flip-angle errors. 
Therefore, the e > Jrp region utilizes RGAQ^a as the optimal 
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sequence structure. The opposing region, where the finite du- 
ration of the pulse is the primary form of pulse error, utilizes 
RGAq4c due to its robustness against finite- width pulses. 

Examining the performance as a function of Tp and e, we 
find that RGAQ^a exhibits the scaling: 



and the performance for RGAq^c is dictated by 
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In the e > Jrp regime, RGAQ/^a performance essentially 
scales unbiased by the finite- width contributions, as expected 
when flip-angle errors are dominant. RGAq/^c becomes the 
more desirable sequence configuration when e < Jrp due to 
its first order suppression in Tp. The performance scaling ob- 
tained for both sequences are consistent with the results shown 
for K = 64 in Fig. 6(e). 

In addition. Fig. 6(e) can also be explained in terms of the 
effective error Hamiltonians for RGAQ^a and RGAq^c given 
by 
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respectively. As expected from the distance measure scal- 
ings, H^^^'^'' is dominated by decoherence operators pro- 
portional to Tp and H^^^^^'' is inhibited by a 0{e^) deco- 
herence operator. One of the most interesting characteristics 
of RGA^/^a^ and all "a"-type sequences, is the persistent co- 
efficient for the 0{JTp) term, despite the increase in cycle, 
an indication of error suppression for finite- width pulse er- 
rors. Another interesting artifact is the robustness of RGAq4c 
to 0{eJTp) errors. RGAq^c does not provide complete error 
suppression for these terms, yet there are significantly fewer 
terms present for RGAq4c than RGAQ4a (not shown here). 



6. 256-Pulse Sequences 

Optimal performance is obtained primarily by four cycles 
of RGAQ4a for K = 256. RGA25Qa emerges as an optimal 
sequence construction when Jr^ > e, however, is surpassed 
by the finite-width-optimized sequence 4 x RGAq^c when 
^ and RGAQ4a for the remainder of the parameter 
space shown in Fig. 6(f). Numerically analyzing the distance 
measure D sls a. function of e, Tp, and Td, the performance for 
RGA256a can be shown to be characterized by the distance 
scaling 
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Comparing the performance scaling of RGA236a to RGA^^a 
in the e ^ jTp regime, we find that RGA^^a is the 
more favorable sequence configuration due to its suppres- 
sion of 0{e^) terms present in the effective error Hamilto- 
nian. RGA2^Qa does not supply an equivalent suppression 
and its performance remains dictated by the quadratic term in 
this particular parameter regime. In the remaining regions of 
Fig. 6(e) the reason for RGAQ^a selection over RG A2^Qa is 
indeterminable from the distance measure scalings since both 
are dictated by 0{JTp). We turn to the effective error Hamil- 
tonian to provide the additional details. 

The effective error Hamiltonian for RGAQ^a is given by 
Eq. (140), while for RGA2S6a we obtain 
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Notice that both effective error Hamiltonians contain decoher- 
ence operators that are 0{JTp), however, RGA256a contains 
only a single term. The additional terms in the effective error 
Hamiltonian for RGA^^a clearly result in RGA256a being the 
perf erred sequence in the e <C Jr^ regime. As the contribu- 
tion of finite-width pulse errors continues to grow, the first- 
order suppression in Tp attained by 4 x RGAq^c is more de- 
sirable; hence, the emergence of this optimal sequence when 
Tp > Td and e <C Jr^. 

In Table V, we present a summary of the performance scal- 
ing for each optimal sequence when subjected to finite- width 
pulses and flip-angle errors. The most interesting aspect is 
the eventual saturation in performance to terms linear in Tp 
or quadratic in e. This is an indication that attaining ro- 
bustness for a wide range of r^, Td, and e solely by manip- 
ulating the sequence configuration is insufficient. However, 
this does not invalidate the sequences we have obtained here 
since if a particular parameter regime is achievable, namely, 
the strong-pulse regime where flip-angle errors are dominant 
(JTp <C JTd <C e), sequences such as RGAsa, RGAiQa 
and RGAQ4a still exhibit a high level of robustness. This is 
not only evident from their ability to suppress errors solely 
proportional to e and Td, but also from the fact that error ac- 
cumulation for 0{JTp) terms is not observed as the number 
of pulses is increased; compare the effective error Hamilto- 
nians of Eqs. (127), (132a)-(132c), and (140). Therefore, if 
the pulse duration can be made small relative to the pulse- 
interval then the finite- width errors are less consequential and 
it is still possible to maintain some form of robustness without 
the need for additional techniques. When such a regime is not 
attainable it may be necessary to utilize pulse shaping tech- 
niques to aid flip-angle error robust sequences in the suppres- 
sion of finite-width pulses, or to exploit composite pulses to 
suppress flip-angle errors for sequences highly robust to finite- 
width pulses. Ultimately, a combination of sequence configu- 
ration, pulse shaping, and composite pulses is most likely the 
path forward to constructing fault-tolerant DD sequences for 
a wide range of parameter regimes. 
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TABLE V. Summary of distance measure D scalings for optimal RGAr sequences identified for DD evolution subjected to finite- width 
pulses of duration Tp and flip-angle errors with rotation error e in the regime of system-environment interaction-dominated (J ^ /3) dynamics. 
The sequences with the best performance scaling in each parameter regime (column) are boxed. Note specifically the case of strong pulses 
dominated by flip-angle errors (Jrp <^ Jrd ^ e) where RGAga, RGAi6a, and RGAe^a obtain the more favorable performance scaling. 



V. COMPARISON WITH KNOWN DETERMINISTIC 
SCHEMES 

In this section, GAk and RGAk optimal sequences are 
compared to two known deterministic DD schemes: CDD and 
QDD, for each pulse profile. 

CDD represents a fair comparison to the numerically opti- 
mal sequences since in both cases the pulse-intervals are fixed 
and the error suppression properties are dictated only by the 
sequence structure. Schemes which rely on optimized pulse 
delays to gain decoupling efficiency, such as QDD (see be- 
low), have much better scaling and error suppression proper- 
ties than fixed delay schemes in the ideal pulse limit. How- 
ever, the GAk optimal sequences can be expected to prevail 
in the case of non-ideal pulse profiles, where no robust version 
of QDD currently exists. Optimal control theory can also be 
used to generate robust DD sequences [78]. 

The foundation of single-qubit CDD rests on the uni- 
versal decoupling group (modulo irrelevant phases) S = 
{/, X, F, Z}, i.e, the single-qubit Pauli group, which imple- 
ments first order error suppression by symmetrizing H^^ over 
S. Higher order error suppression can be achieved by contin- 
uing the symmetrization recursively, such that each additional 
level averages out the leading term in the Magnus expansion 
of each sub-level. At the rth level of recursion, 

CDD^ = ?CDD^_iXCDD^_i?CDD^_iXCDD^_i(144) 

where CDDq = fr^ the first level of symmetrization, r = 
1, corresponds to RGA'^ with {Pi, P2} = {X, Y}\ 

CDDi = RGA^> 

= {YU,Y){ZU,Z){XU,X){IUJ). (145) 

Note that 5-based CDD requires 4^ pulses to accomplish rth 
order error suppression [23]. 

A large body of work now exists concerning DD sequences 
with non-uniform pulse-intervals, a technique which enables 
a drastic improvement over the exponential scaling of CDD. 



Uhrig DD (UDD) is one such method, which applies DD 
pulses separated by unequal time intervals, at instants deter- 
mined by a closed form expression originally developed in the 
context of the spin-boson model [34]. Using N control pulses, 
UDD suppresses the first N orders of the time-dependent per- 
turbation theory expansion along the direction which does 
not commute with the pulses, provided the bath spectral den- 
sity contains a sharp high-frequency cutoff [34, 35], or for 
generic bounded bath operators [36]. For an analysis of the 
scaling properties and corresponding distance measure per- 
formance of UDD see Ref. [38]. UDD can be extended to 
combat general single-qubit decoherence by nesting two anti- 
commuting UDD sequences. The resulting quadratic DD 
(QDD) scheme [39], achieves min(Mi,M2)th order decou- 
pling using Ml X M2 pulses in the ideal pulse limit, where 
Mj, j = 1,2, refers to the number of pulses in each nested 
UDD sequence. The decoupling efficiency of QDD has been 
extensively studied and confirmed numerically [40] and ana- 
lytically [41]. Further generalizations of UDD include con- 
catenated UDD (CUDD) [37] and nested UDD (NUDD) [42]. 
CUDD alleviates the restriction of single-channel decoher- 
ence suppression, however, still suffers from the exponen- 
tial scaling of CDD. NUDD essentially follows the nesting 
scheme of QDD, while providing a generalization to multi- 
qubit and multi-level systems. As in the case of UDD and 
QDD, NUDD has been proven to apply to general decoher- 
ence models with bounded bath operators [43, 44]. For an 
analysis of the scaling properties and corresponding distance 
measure performance of QDD and NUDD see Ref. [79]. 

Most relevant to us, because it decouples a single qubit 
from a general system-bath interaction, is the QDD sequence, 
generated by nesting two UDD sequences as 

M2+1 

qdDm,.m. = rr^+' n r2UDD^^(Af=V,) 

= UDD^JUDD^^(Trf)], (146) 
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FIG. 7. (color online) Comparison of performance for CDD (empty 
symbols) and GAr (filled symbols) as a function of the pulse- 
interval Td in the ideal pulse limit. The strength of the error Hamil- 
tonian is chosen as J = IMHz and the strength of the pure- 
environment dynamics /3 — IkHz. Optimal GA sequences achieve 
a higher of order error suppression than CDD as the number of 
pulses increases; note GAe^a and GA256a as compared to CDD3 
and CDD4, respectively. 



FIG. 8. (color online) Comparison of performance for QDD (empty 
symbols) and GAr (filled symbols) as a function of the pulse- 
interval Td in the ideal pulse limit. The strength of the error Hamil- 
tonian is chosen as J = IMHz and the strength of the pure- 
environment dynamics /3 = IkHz. QDD outperforms GAk for 
all sequence lengths, consistent with the expected superiority of 
interval-optimized schemes in the ideal pulse limit. 



where Fi 7^ r2 are the generators of S and 

M+l 
k=l 



(147) 



The free evolution periods for each UDD sequence are dic- 
tated by the normalized pulse-intervals 
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where 
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, j = l,2,...,M + l, (149) 



and (minimum) pulse delay Td. In the following analysis, we 
will focus on Mj = M, j = 1, 2, to account for the effectively 
uniform decoherence model of Eq. (15) and arbitrarily choose 
Tj e {Z, X} as the generators of S. 



deterministic sequence structure for fixed pulse delays. In 
Sec. IV A 6, we elaborate on this fact and explore the possibil- 
ity of designing a deterministic scheme to correctly describe 
the optimal GA sequences. 

Pulse interval optimized sequences have been shown to far 
surpass the decoupling efficiency of any known fixed pulse- 
interval scheme in the ideal pulse limit, requiring only a 
quadratic increase in the number of pulses to suppress an 
additional order of the Magnus expansion. We validate the 
above statement by comparing QDDm to the optimal GA se- 
quences in Fig. 8 for M = 1, 3, 7, 15; an equivalent number 
of pulses to K = 4, 16, 64, 256, respectively. For each set of 
sequence orders {M, M} QDDm attains a decoupling order 
of M, which is beyond the ability of the GA^ for an equiv- 
alent number of pulses. QDDm superiority is an indication 
that optimizing with respect to pulse-interval, in addition to 
pulse configuration, is necessary to obtain higher decoupling 
efficiency. 



B. Finite-width Pulses 



A. Ideal Pulses 

In Figure 7, we compare the performance of CDD/ to GA^ 
with respect to Jrd e [10"^^, 10^] for J = IMHz and 
(3 = IkHz in the ideal pulse limit. Numerically optimal se- 
quences first coincide with CDD^ at K = 16, where both 
achieve second order error suppression. The main advantage 
of GAiQa over CDD2 is a reduction in the error amplitude 
by a factor of approximately 10^. Significant improvement 
in error suppression is observed for K = 64 and K = 256, 
where GA optimal sequences offer an additional order of er- 
ror suppression over corresponding CDD sequences, r = 3, 4. 
The results indicate that CDD does not constitute an optimal 



In Figs. 9 and 10, CDD^ and QDDm, respectively, are 
compared to the RGAk sequences optimized for finite pulse 
width: K = 4, 8c, 166, 326, 64c, 256. The pulse-interval is 
chosen as Td = 0.1ns, while Tp/rd G [10"^, lO'^], J = IMHz, 
and 13 = IkHz. All results are averaged over 10 random in- 
stances of the Hamiltonian. 

Optimal performance for RGAk is observed predomi- 
nately for K = 32c, 64c with K = Wa' exhibiting a more fa- 
vorable performance only in the strong-pulse regime, namely 
when Tp/rd < 10~^ in Fig. 9. As the finite- width pulse er- 
rors contribute more substantially, 0{J'^t'^) terms remaining 
in the effective error Hamiltonian for K = 16c results in a 
rapid decrease in performance leading to RGAs2c/RGAq4c- 
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FIG. 9. (color online) Comparison of performance between RGAr 
and CDDi when subjected to finite pulse duration. Performance is 
characterized as a function of Tp, while Td = 0.1ns. CDD^ perfor- 
mance is essentially the same for all r, scaling sls D ^ 0{JTp). 
RGAk achieves a significant increase in robustness over CDD^ at 
K = 8a , 16a , 32c, 64c, where the performance surpasses the lin- 
ear scaling in Tp. Note the eventual saturation in decoupling order 
characteristic of the rectangular pulse profile displayed by the nearly 
equivalent scaling of K = 32c, 64c. All results are averaged over 10 
realizations of B^. Error bars are included, but are quite small. 



FIG. 10. (color online) Comparison of performance between RGAr 
and QDDm, M = 1, 3, 7, 15, when subjected to finite pulse dura- 
tion. The pulse-interval is fixed at Td = 0.1ns and performance is 
analyzed as a function of Tp. QDDm performance becomes increas- 
ingly worse as the sequence order increases due to an accumulation 
in errors brought about by the finite duration of the pulses. As in the 
case of CDDr, QDDm performance maintains D ^ 0{JTp) for all 
M. See Fig. 9 for details regarding RGAr performance. 



C. Flip-angle errors 



dominance. K = 4 maintains the lowest performance of all 
RGAk sequences due to its inability to suppress the first or- 
der contribution in Tp, D ~ 0{Jrp). CDD^ performance is 
nearly equivalent to i^T = 4 for Tp/ra > 10 scaling as 
D ~ 0{jTp) for all r. The most noticable difference occurs 
at Tp/Td < 10~^, where CDD^ maintains the linear scaling 
in Tp for r = 2, 3, 4 and surpasses K = A^Sc. In Ref. [23], 
an analysis of CDD^ in the presence of finite pulse width is 
discussed as well. There it was shown that CDD^ can re- 
duce pulse-width errors as the concatenation level increases 
if Tp <C Td. Although the total cycle time was fixed, as op- 
posed to the pulse-interval, the results obtained here are quite 
similar in the Tp/Td < 10~^ regime and confirm the inherent 
robustness of CDD^ to finite- width pulse errors. 

As discussed in Refs. [80], UDD-based schemes are quite 
susceptible to finite pulse-width errors and must be imple- 
mented with specially tailored pulses to regain a portion of 
the UDD decoupling efficiency. We confirm this result here 
for the most simplistic pulse shape: the rectangular pulse. In- 
creasing the sequence order does not result in an increase, or 
sustainability, of performance; rather an accumulation of er- 
ror results. As in the case of CDD/, the performance main- 
tains a linear scaling in Tp throughout the specified range for 
the higher of the three sequence orders: M = 3, 7, 15. The 
only variation occurs at M = 1 where the performance be- 
comes dependent upon Td for Tp <C Td. Although there ex- 
ists a regime where both deterministic schemes outperform 
K = A^Sc, higher order RGAk sequences provide a level of 
robustness that cannot be matched by either CDD^ or QDDm • 



In Sec. IV C, robust sequences were identified for control 
pulses subjected to flip-angle errors. Here, we compare the 
numerically optimal RGAk sequences to CDD^ and QDDm 
as a function of e. We consider the case of interaction- 
dominated dynamics and set the strengths of the environ- 
ment dynamics and system-bath interaction to (3 = IkHz 
and J = IMHz, respectively. The pulse delay is chosen as 
Td = O.lns and all data is averaged over 20 realizations of 
the bath operators {B^}. In addition to i^T = 2, 8a, 16a, 64a, 
which exhibit an increase in decoupling order for terms pro- 
portional to e, K = 4, 32a, 256a are included in the compar- 
ison as well to fully characterize RGAk performance with 
respect to both deterministic schemes. 

We first focus on RGAk and CDD^, where RGAk supe- 
riority is clearly evident for all values of K shown in Fig. 1 1 . 
Optimal performance is observed for K = 2, 8a, 16a, 64a, as 
expected, with K = 16a providing the highest level of robust- 
ness to flip-angle errors using the smallest number of pulses. 
Although the lowest level of performance for RGAk occurs 
at i^T = 4, 32a, 256a, a considerable improvement over the 
corresponding CDD^, r = 1,2,3,4, exists. Optimal CDD^ 
performance is achieved at r = 2,4, where the performance 
can be shown to scale sls D ~ 0{e'^). The remaining levels 
of concatenation, r = 1 , 3 do not achieve first order suppres- 
sion, therefore, D ~ 0{e). The comparison clearly indicates 
that the numerically optimized sequences are highly robust to 
flip-angle errors and capable of achieving performances far 
beyond those of CDD^. 

Analyzing QDDm, for M = 1, 3, 7, 15, as a function of e, 
we find that the performance maintains sl D ^ O(e^) scaling 
for all M; see Fig. 12. A similar performance scaling is ob- 
tained for r = 2, 4 in CDD^ as well, where the performance 
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FIG. 11. (color online) Perfomance of RGAr sequences versus 
CDD; as a function of flip-angle error e G [0.01, 0.2] averaged over 
20 realizations of B^. The relevant parameters are chosen as J = 
IMHz, 13 = IKHz, and Td = 0.1ns. Numerically optimal sequences 
are found to be highly robust against flip-angle errors, significantly 
out-performing CDDr for r = 1, 2, 3, 4. 



is equivalent for both concatenation levels. In contrast, the 
performance for QDDm diminishes with increasing M, in- 
dicating an accumulation of error rather than a reduction, or 
sustainability as in the case of CDD^. Although variations 
in performance exist between the deterministic schemes, their 
robustness to flip-angle errors is clearly not comparable to the 
optimized RGAk sequences. 



D. Finite-width and Flip-angle Errors 

As a final comparison, we examine the performance and 
robustness of RGAk, CDD^, and QDDm in the presence of 
both finite- width and flip-angle pulse errors. In particular, we 
focus on the performance as a function of e and Tp relative 
to the size of the pulse-interval = 0.1ns. The flip-angle 
error is varied from zero to a 15% rotation error, the ratio of 
pulse duration to interpulse delay Tp/rd G [10~^, 10^], and 
J = IMHz with 13 = IkHz. We average over 10 realizations 

The performance of RGAk is shown in Fig. 13 in pan- 
els (a)-(d) for K = 4, 16a, 64a, 256a, respectively. Although 
additional optimal configurations were identified at other se- 
quence lengths, we focus on these particular values of K to 
compare them directly to CDD^, r = 1, 2, 3, 4, and QDDm, 
M = 1, 3, 7, 15. RGAk performance is found to be primar- 
ily dependent upon r^, only showing significant e-dependence 
when e < 0.04. Below e = 0.04, specifically in the region 
where Tp < Td, robustness increases as the number of pulses 
is increased from K = 16a to i^^ = 256a. Error accumula- 
tion within this particular range of values appears to not be an 
issue. 

CDD^ performance is displayed in panels (e)-(h) of Fig. 13 
for r = 1,2,3,4, respectively. Contrary to RGAk, CDD^ 
performance exhibits a strong dependence on flip-angle er- 
rors rather than finite pulse duration. Optimal performance is 



FIG. 12. (color online) Performance of RGAr sequences versus 
QDDm as a function of flip-angle error e G [0.01,0.2] averaged 
over 10 realizations of B^. The sequence orders are chosen as 
M = 3, 7, 15 and directly correspond to K = 16, 64, 256. QDD 
is shown to be highly sensitive to flip-angle errors, decreasing in per- 
formance as M grows. Again, robust GA sequences achieve optimal 
performance. See Fig. 1 1 for additional details regarding simulation 
parameters. 

heavily concentrated around small values of e due to the low 
level of robustness exhibited by CDD^ for flip-angle errors. 
Robustness to finite pulse duration appears to be most noti- 
cable for r = 2,4, although performance is still rather poor 
compared to RGAk- 

Lastly, we examine QDDm in panels (i)-(l) in Fig. 13, 
where M = 1,3,7,15, respectively. The lowest sequence 
order, M = 1, generates the exact same sequence as RGA4, 
therefore, performance is identical. The remaining sequence 
orders result in continual error accumulation, which is evi- 
dent from the steady decline in performance from M = 3 
to M = 15. As in the case of CDD^, QDDm performance 
is primarily e-dependent and degrades rapidly with increasing 
e. Similarly to the additional comparisons of numerically op- 
timized sequences and deterministic schemes for faulty DD 
pulses given above, RGAk sequences significantly outper- 
form CDD^ and QDDm- 

VI. EXISTENCE OF DETERMINISTIC STRUCTURE 

Concatenation appears to play an important role in the con- 
struction of optimal DD in the limit of fixed pulse-intervals. 
One of the goals of this study is to determine whether the 
optimal sequences provided above can be generalized into a 
deterministic concatenation scheme for arbitrary order decou- 
pling. As suggested by the analysis summarized in Table II, 
we find that such a scheme is possible by utilizing GA^a as 
the fundamental unit of concatenation in 

GAi'J=GAsa[G4l-% (150) 

where GA^^^ = fr^. Requiring 8^ pulses, GA^^^ achieves 
2qth order error suppression; a factor of 2^ improvement over 
the decoupling efficiency of CDD, which demands 4^^ pulses 



30 




-4 



-2 2 

(a) RGA^ 




-4-2 2 

(e)CDDi 



-4-2 2 

(i)QDDi 





0.14 
0.12 
0.10 
vuO.OS 
0.06 
0.04 
0.02 
0.00 



0.14 
0.12 
0.10 

Vi;0.08 

0.06 
0.04 
0.02 
0.00 



-2 2 

{h)RGAiQa 




-4-2 2 
logioCV^^) 

(f)CDD2 



-4-2 2 

logio(V^rf) 




0.14 
0.12 
0.10 
,0.08 
0.06 
0.04 
0.02 
0.00 



-2 2 

logioC'Z'A^) 

{c)RGAQ^a 




-4-2 2 

logioCV^'/) 

(g)CDD3 



-4-2 2 

lOgl0(T;,/Tj) 

(k)QDD7 



0.14 




0.12 




0.10 




0.08 




0.06 




0.04 




0.02 




0.00 





0.14 
0.12 
0.10 

;0.08 

0.06 
0.04 
0.02 
0.00 



0.14 
0.12 
0.10 

;0.08 

0.06 
0.04 
0.02 
0.00 



-2 2 

logio(T;,/Trf) 

(d)i?GA256a 




-4-2 2 

logioCTp/Trf) 

(h)CDD4 




-4-2 2 

logio(Tp/Trf) 

(DQDDiB 



-7 


logioD 


-1 

























FIG. 13. (color online) Performance of RGAr, CDDr, and QDDm when subjected to finite pulse duration and flip-angle errors. The pulse- 
interval is fixed at Td = 0.1ns, while J = IMHz and /3 = IkHz. RGAr sequences significantly outperform both CDDr and QDDm for 
K = 4, 16, 64, 256. The most notable region of robustness exists for e < 0.04 and Tp < Td for K = 16, 64, 256. 



to achieve an equivalent decoupling order. The increased de- 
coupling efficiency is facilitated by second order error sup- 
pression provided by GA^a, which essentially boosts the effi- 
ciency by a factor of 2 at each level of concatenation. 

In Fig. 14, we compare the performance of GA^g^ to that of 
CDDr and GA4 in the case of zero-width pulses as a func- 
tion of the number of pulses K. The results are averaged 
over 25 random realizations of B^, where the pulse operators 
are designated by {Pi,P2} = {X, F} for each generalized 
sequences. The strengths of the error Hamiltonian and bath 
dynamics are given by J = IMHz and (3 = IkHz, respec- 
tively, and the minimum pulse delay Td = 0.1ns. In com- 



paring q = 1, 2, . . . , 4 and / = 1, 2, . . . , 6, the performance 
of GA^^^ improves dramatically as the level of concatenation 
increases, far exceeding that of CDD^ and GA4. 

However, the truly meaningful test of sequence perfor- 
mance is in the presence of pulse errors. Defining 

RG4'^ = RGA8a[RG4l-% (151) 

to effectively combat the inclusion of flip-angle and finite- 
width pulse errors, we compare the performance of RGA^g^ 
to CDDr and RGA^ in Fig. 15 for {Pi, P2} = {Y, X}. The 
relevant Hamiltonian parameters are equivalent to those cho- 
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FIG. 14. (color online) Comparison of GA^^}, CDD^, and XY^ per- 
formance after one cycle as a function of the number of pulses K 
for ideal, zero- width pulses. The strength of the error Hamiltonian 
and environment dynamics are set to J = IMHz and (3 — IkHz, re- 
spectively, and the minimum pulse-interval ra = 0.1ns. Results are 
averaged over 25 random realizations of B^, where the error bars are 
shown, but quite small. As expected by the results of the GA search 
GAg^\ q= 1, . . . , 4 is indeed superior to CDDr, r = 1, . . . , 6, and 
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FIG. 15. (color online) Comparison of RGA"^^, CDD^, and RGA4 
performance versus the number of pulses K for combined pulse er- 
rors (flip-angle and finite- width) after one cycle. Hamiltonian param- 
eters, J and 13, are the same as those in Fig. 14 with the total cycle 
time fixed at Tc = Ins, as opposed to r^. Results are averaged over 
25 realizations of with e = 0.01 and Tp/rc — 10~^°. The per- 
formance of RGA^Q^ improves as the number of pulses is increased 
within a given cycle time Tc. For the specified parameters, CDD^ 
does not exhibit enhanced performance with increasing concatena- 
tion level. 



sen for the ideal case, while the flip-angle error e = 0.01. 
As opposed to fixing t^, we select a fixed cycle time Tc to 
analyze the relationship between and Td as the number 
of pulses grows. In particular, we consider Tc = Ins and 
^p/^c = 10~^^. We expect robustness to be most noticeable in 
the strong pulse regime, Tp ^ Td, where the primary form of 
pulse error is due to the flip-angle errors. Performing the anal- 
ysis with a fixed cycle time allows us to examine robustness 
as a function of concatenation level and as Tp Td, simul- 
taneously. As expected by direct calculation of the effective 
Hamiltonian for RGA4 [see Eq. (124)], increasing the number 
of pulses via multiple DD cycles does not offer an enhance- 
ment in sequence performance due to an immediate accumu- 
lation of error proportional to Tp and . In contrast, CDD^ 
performance remains fairly consistent and oscillates between 
two values, seemingly dependent on the parity of the concate- 
nation level. In Refs. [23, 25], a similar study of CDD per- 
formance conveyed that effective cancellation of pulse- width 
errors occurs for a range of concatenation levels if the pulse 
width is much smaller than the pulse-interval; eventual satura- 
tion in performance as r increases occurs once this condition 
is violated. The performance characteristics of CDD^ differ 
here due to the presence of flip-angle errors, which are not 
accounted for in Refs. [23, 25]. Combined errors are most 
effectively addressed by RGA^^^ , as can be seen by the im- 
provements in sequence performance as concatenation level 
increases. Note that the performance eventually begins to 
show signs of saturation as the concatenation level increases. 
Essentially, the pulse-interval is approaching a value compara- 
ble to the pulse duration which leads to finite- width errors be- 
coming a more significant decoherence mechanism than flip- 
angle errors or the error Hamiltonian. RGA^a and its concate- 
nated versions do not provide protection against finite- width 



errors, and therefore their performance becomes hindered by 
the presence of terms that are first order in Tp. 

VII. CONCLUSIONS 

In this work we showed that numerically optimal DD se- 
quences can be constructed using a genetic algorithm in 
conjunction with a simulated annealing convergence accel- 
erator and a novel complexity-reduction technique. The 
search focused on sequences containing K = 1,2,..., 256 
pulses, however, we identified optimal performance at K = 
4, 8, 16, 32, 64, 256 and compared each sequence to known 
deterministic schemes, such as CDD and QDD. The ideal- 
pulse analysis showed that, under the constraint of a fixed 
pulse interval, optimal sequences can be constructed which 
outperform CDD, yet fall short of the decoupling efficiency 
realized by QDD. Optimization proved to be quite beneficial 
in the case of finite- width and flip-angle errors, where numer- 
ically optimal sequences obtained a level of robustness that 
could not be reached by either CDD or QDD. 

By analyzing sequence structure, we found that concate- 
nation appears to be the preferred method for high order de- 
coupling for all pulse profiles considered. This technique is 
most beneficial when each concatenation sub-level attains the 
same decoupling order. As most notably observed for faulty 
DD pulses subjected to flip-angle errors [K = 32, 256], con- 
catenating sequences with unequal decoupling orders results 
in considerable attenuations in sequence performance and er- 
ror suppression that can be attributed to the inherent non- 
robustness of the low-order decoupling sequence. 

The culmination of our study is centered around the iden- 
tification of a deterministic sequence structure that obtains a 
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high decoupHng efficiency in the ideal Hmit and robustness 
to errors generated by pulse imperfections. We determined 
that RGAsa [Eq. (105)] is the favored generating sequence 
for a majority of the pulse profiles considered. Concatenating 
this particular sequence, it is possible to suppress the first 2q 
terms in the Magnus expansion using 8^ pulses. Compared 
to the 4^ pulses required for the widely used original version 
of CDD [22], the concatenated version of RGAsa utilizes 2^ 
fewer pulses to obtain the same decoupling order. Although it 
is not possible to obtain such high levels of decoupling in the 
presence of pulse imperfections, RGAga contains an inherent 
robustness that continues to aid the error suppression process 
as the level of concatenation increases. This result is most ap- 
parent in our final study of faulty DD pulses, which includes 
both finite duration and flip-angle errors; we found the con- 
catenated RGAsa construction to be the most robust scheme 
available for fixed pulse-interval DD sequences. 

The importance of optimizing over pulse-intervals was 
clearly displayed in the ideal pulse analysis, where sequence 
configuration optimization alone could not supply the decou- 
pling efficiency achieved by QDD. However, in agreement 
with previous work, we have shown that pulse-interval op- 
timized sequences fail to be robust against additional errors 
generated by faulty DD pulses. Future work should focus on 



extending the search algorithm to incorporate multi-qubit sys- 
tems and unequal pulse delays, to obtain robust DD sequences 
in the presence of various pulse errors. 
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Appendix A: Scaling of Performance 

Here, we prove the scaling of distance measure D{U^Is) 
described in Eq. (26) for the ideal pulse limit. First, we prove 
that the general distance measure D{U^G) [see Eq. (22)] sat- 
isfies 



D{U,G)<^\\U- 



■G^Ib\ 



(Al) 



The upper-bound is obtained by utilizing the steps originally 
talcen in Ref. [69] to obtain the closed form expression of 
D(U^G) given in Eq. (23), where initially it is shown that, 
for <l> satisfying <l>^<l> = 7^, 



D{U,G) 



1 



V^dsdB ^ 



minWU -G^M 



Re{Tr[[/(Gt 0^t)]} 



= min Wl - 

^ V dsdB 



Re{TrB [Tr s[U {G^ ^ I b)W]} 

(A2) 



^ V dsdB 



™^ A/ 1 ~ ~r~r 
^ V dsdB 



Re[Tr(r<l>t)]. 



It is then noticed that computing the minimization problem 
of Eq. (22) is equivalent to finding the maximum value of 
Re[Tr(r$^)] over all unitary ^. In order to complete the 
proof, the singular value decomposition (SVD) F = lySF^ is 
invoked, where V are unitary and S = diag(5i, . . . , Sd^^) 
is a real diagonal matrix containing the singular values si > 
52 > • • • > ^ 0- The relevant expression becomes 

Tr(F<l>'^) = Tr(WS0<l>'^) = Trp(0<l>'^iy)] (A3) 

and the final details of the proof involve showing that 
Re{Tr[S(V^<l>^iy)]} is essentially maximized if and only if 
V^^^W = Ib, or equivalently when ^ = WV^ . Note that 



Re{Trp(Vt<|,t|y)]} < Tr(S) 



(A4) 



holds for all ^ that differ from WV^ . 

It is now straightforward to obtain the bound expressed in 
Eq. (Al) since by choosing <l> = we are satisfying the 
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lower bound expression of Eq. (A4) and generating the upper- 
bound 



D{U,G) < 



1 



^/2dsdi 



\U-G^Ib\\f (A5) 



<—\\U-G^IbI 



(A6) 



Note that we have used the inequaUty \\T\\f < Vdsd^\\T\\ 
in order to transform from the Frobenius norm [Eq. (A5)] to 
the sup-operator norm [Eq. (A6)] . 

Choosing G = i^, as specified by the desired action of the 
DD evolution on the system, and the toggling frame evolu- 
tion operator U = /7o (tc), the distance measure upper-bound 
becomes 



D < ^\\Uo{rc) - Is ^ Ib\ 



(A7) 



Expressing the unitary evolution operator Uo{rc) as a time- 
dependent perturbation expansion 

CO 

Uo{rc) = Is^Ib^Y1 ^o"\rc). (A8) 

n=l 

with Dyson operators 

ut\rc) = {-iT rdh--- dtnflHoitj), (A9) 

^0 Jo 

we obtain, using the triangle inequality, 

^ OO ^ CO 

^ n=l ^ n—1 

The final step is to show that ||t/n('?"c)|| obtains the scaling 
claimed by Eq. (26). This is accomplished by using (1) the 
triangle inequality, (2) sub-multiplicativity, and (3) unitary 
invariance to obtain an upper-bound on Eq. (A9) as follows: 
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< 
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(All) 



As a result, Eq. (A 10) achieves the upper-bound 

1 °° 

^ n=l 



^ CO ^ 1 



Let us now consider the case of A^th order error suppression, 
where all U^\tc) fox n < N vanish. The bound then trun- 
cates to 



D 



1 ^ 



(A13) 



which scalings accordingly as 

D<0[ri^+i(J + /3)^+i], (A14) 
when Jtc <C 1 and <C 1 are satisfied. 



Appendix B: Extracting effective error Hamiltonian scaling 
numerically 



In Sec. IV A, we discussed the scaling of the distance mea- 
sure D for each optimal sequence in the ideal pulse limit with- 
out direct calculation of the effective Hamiltonian. We obtain 
this scaling by assuming that D has the form 



(Bl) 



where N is the decoupling order of the sequence and nj + 
n/3 = A^ + 1. First, the decoupling order is determined by 
examining logio ^ as a function of Td, as this quantity scales 
linearly in with a slope of A^ + 1. The scaling of is 
only dependent upon the decoupling order A", and therefore is 
independent of the relative magnitudes of J and {3. The only 
constraint we consider is Jrd <C 1 and pTd <C 1 in order to 
satisfy the condition ||i^gj^r(i|| <C 1 discussed in Sec. II A for 
effective error suppression. 

The scaling of the remaining quantities, J and /3, is de- 
termined for each relevant parameter regime (J <^ f3 and 
J P)hy analyzing logio ^ as a function of Jrd by vary- 
ing J for fixed Td and (3. The logarithm of the performance 
measure can again be expected to scale linearly in Jrd, now 
with a slope of nj. The value of nj will ultimately depend 
on the magnitude of J relative to (3, however in either case 
n J is well-defined. The scaling of (3 is now determined from 
nj -\- rif^ = N -\- 1, where n/3 is the only unknown quantity. 
Note that this method can be easily extended to include finite- 
width pulses, flip-angle errors, or both, by assuming 



respectively, where Up Ud = nj ^ n^. 



(B2a) 

(B2b) 
(B2c) 



